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Functional and Hyperfunctional siRNA 

5 Cross Reference to Related Applications 

This application claims the benefit of the filing date of U.S. Provisional 
Application Serial No. 60/426,137, filed November 14, 2002, entitled "Combinatorial 
Pooling Approach for siRNA Induced Gene Silencing and Methods for Selecting 
siRNA," and U.S. Provisional Application Serial No. 60/502,050, filed September 10, 
10 2003, entitled "Methods for Selecting siRNA," the entire disclosures of which are 
hereby incorporated by reference into the present disclosure. 

Field of Invention 

The present invention relates to RNA interference ("RNAi"). 

15 

Backgronnd of the Invention 

Relatively recently, researchers observed that double stranded RNA 
("dsRNA") could be used to inhibit protein expression. This ability to silence a gene 
has broad potential for treating human diseases, and many researchers and 
20 commercial entities are currently investing considerable resources in developing 
therapies based on this technology. 

Double stranded RNA induced gene silencing can occur on at least three 
different levels: (i) transcription inactivation, which refers to RNA guided DNA or 
25 histone methylation; (ii) siRNA induced mRNA degradation; and (iii) mRNA induced 
transcriptional attenuation. 

It is generally considered that the major mechanism of RNA induced silencing 
(RNA interference, or RNAi) in mammalian cells is mRNA degradation. Initial 
30 attempts to use RNAi in mammalian cells focused on the use of long strands of 

dsRNA. However, these attempts to induce RNAi met with limited success, due in 
part to the induction of the interferon response, which results in a general, as opposed 
to a target-specific, inhibition of protein synthesis. Thus, long dsRNA is not a viable 
option for RNAi in mammalian systems. 
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More recently it has been shown that when short (1 8-30 bp) RNA duplexes are 
introduced into mammalian cells in culture, sequence-specific inhibition of target 
mRNA can be realized without inducing an interferon response. Certain of these 
5 short dsRNAs, referred to as small inhibitory RNAs ("siRNAs"), can act catalytically 
at sub-molar concentrations to cleave greater than 95% of the target mRNA in the 
cell. A description of the mechanisms for siRNA activity, as well as some of its 
applications are described in Provost et al 9 Ribonuclease Activity and RNA Binding of 
Recombinant Human Dicer, E.M.B.O. J., 2002 Nov. 1; 21(21): 5864-5874; Tabara et 
10 al, The dsRNA Binding Protein RDE-4 Interacts with RDE-1, DCR-1 and a DexH- 

boxHelicase to Direct RNAi in C. elegans, Cell 2002, June 28;109(7):861-71; Ketting 
et aL, Dicer Functions in RNA Interference and in Synthesis of Small RNA Involved in 
Developmental Timing in C. elegans; Martinez et al, Single-Stranded Antisense 
siRNAs Guide Target RNA Cleavage in RNAi, Cell 2002, Sept. 6; 110(5):563; 
15 Hutvagner & Zamore, A microRNA in a multiple-turnover RNAi enzyme complex, 
Science 2002, 297:2056. 

From a mechanistic perspective, introduction of long double stranded RNA 
into plants and invertebrate cells is broken down into siRNA by a Type III 

20 endonuclease known as Dicer. Sharp, RNA interference— 2001, Genes Dev. 2001, 

15:485. Dicer, a ribonuclease-III-like enzyme, processes the dsRNA into 19-23 base 
pair short interfering RNAs with characteristic two base 3' overhangs. Bernstein, 
Caudy, Hammond, & Hannon, Role for a bidentate ribonuclease in the initiation step 
of RNA interference, Nature 2001, 409:363. The siRNAs are then incorporated into 

25 an RNA-induced silencing complex (RISC) where one or more helicases unwind the 
siRNA duplex, enabling the complementary antisense strand to guide target 
recognition. Nykanen, Haley, & Zamore, A TP requirements and small interfering 
RNA structure in the RNA interference pathway, Cell 2001, 107:309. Upon binding to 
the appropriate target mRNA, one or more endonucleases within the RISC cleaves the 

30 target to induce silencing. Elbashir, Lendeckel, & Tuschl, RNA interference is 
mediated by 21- and 22-nucleotide RNAs, Genes Dev 2001, 15:188, Figure 1. 

The interference effect can be long lasting and may be detectable after many 
cell divisions. Moreover, RNAi exhibits sequence specificity. Kisielow, M. et al. 



WO 2004/045543 PCT/US2003/036787 

3 

(2002) Isoform-specific knockdown and expression of adaptor protein ShcA using 
small interfering BN A, J. of Biochemistry 363: 1-5. Thus, the RNAi machinery can 
specifically knock down one type of transcript, while not affecting closely related 
mRNA. These properties make siRNA a potentially valuable tool for inhibiting gene 
5 expression and studying gene function and drug target validation. Moreover, siRNAs 
are potentially useful as therapeutic agents against: (1) diseases that are caused by 
over-expression or misexpression of genes; and (2) diseases brought about by 
expression of genes that contain mutations. 

10 Successful siRNA-dependent gene silencing depends on a number of factors. 

One of the most contentious issues in RNAi is the question of the necessity of siRNA 
design, i.e., considering the sequence of the siRNA used. Early work in C. elegans 
and plants circumvented the issue of design by introducing long dsRNA (see, for 
instance, Fire, A. et al. (1 998) Nature 391 :806-81 1). In this primitive organism, long 
1 5 dsRNA molecules are cleaved into siRNA by Dicer, thus generating a diverse 

population of duplexes that can potentially cover the entire transcript. While some 
fraction of these molecules are non-functional (i.e. induce little or no silencing) one or 
more have the potential to be highly functional, thereby silencing the gene of interest 
and alleviating the need for siRNA design. Unfortunately, due to the interferon 
20 response, this sefihe approach is unavailable for mammalian systems. While this effect 
can be circumvented by bypassing the Dicer cleavage step and directly introducing 
siRNA, this tactic carries with it the risk that the chosen siRNA sequence may be non- 
functional or semi-functional. - r ' 

25 A number of researches have expressed the view that siRNA design is not a 

crucial element of RNAi. On the other hand, others in the field have begun to explore 
the possibility that RNAi can be made more efficient by paying attention to the design, 
of the siRNA. Unfortunately, none of the reported methods have provided a 
satisfactory scheme for reliably selecting siRNA with acceptable levels of 

30 functionality. Accordingly, there is a need to develop rational criteria by which to 
select siRNA with an acceptable level of functionality, and to identify siRNA that 
have this improved level of functionality, as well as to identify siRNAs that are 
hyperfunctional. 
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Summar y of the Invention 

The present invention is directed to increasing the efficiency of RNAi, 
particularly in mammalian systems. Accordingly, the present invention provides kits, 
siRNAs and methods for increasing siRNA efficacy. 

5 

According to one embodiment, the present invention provides a kit for gene 
silencing, wherein said kit is comprised of a pool of at least two siRNA duplexes, 
each of which is comprised of a sequence that is complementary to a portion of the 
sequence of one or more target messenger RNA. 

10 

According to a second embodiment, the present invention provides a method 
for optimizing RNA interference by using one or more siRNAs that are optimized 
according to a formula (or algorithm) selected from: 
Formula I 

1 5 Relative functionality of siRNA= -(GC/3) +(AUi 5 -i 9 ) -(Tm 20 °c)*3 -(Gi 3 )*3 -(Ci 9 ) 
+(A 19 )*2 +(A 3 ) +(U 10 )+(Ai 4 ) -(Us) -(An) 



Formula II 

Relative functionality of siRNA= -(GC/3) -(AU 15 -i 9 )*3 -(G 13 )*3 -(Ci 9 ) +(Ai 9 )*2 
20 +(A 3 ) 

Formula III 

Relative functionality of siRNA= -(GC/3) +(AUi 5 -i 9 ) -(Tm 2 o°c)*3 

25 Formula IV 

Relative functionality of siRNA= 

-GC/2+( AUi5-i 9 )/2-(Tm 20 o C )*2 -(Gi 3 )*3 -(C 19 ) +(A i9 )*2 +(A 3 ) +(Ui 0 )+(A 14 ) ~(U 5 ) - 
(An) 

30 Formula V 

Relative functionality of siRNA=-(G X3 )*3 -(Ci 9 ) +(A i9 )*2 +(A 3 ) + (Ui 0 )+(Ai 4 ) -(U 5 ) 
-(An) 

Formula VI 
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Relative functionality of siRNA=-(Gi 3 )*3 -(C 19 ) +(Ai 9 )*2 +(A 3 ) 
Formula VII 

Relative functionality of siRNA--(GC/2) +(AUi 5 -i 9 )/2 -( Tm 20 °c)*l ~(G 13 )*3 -(C19) 
5 +(Ai 9 )*3 +(A 3 )*3 +(Uio)/2+(A 14 )/2 -(U 5 )/2 -(Ai i)/2 



wherein in Formulas I — VII: 

Tm 20°c=l if the Tm is greater than 20°C; 
10 Ai 9 = 1 if A is the base at position 19 on the sense strand, otherwise its value 



is 0; 



at 



0; 



0; 



0; 



or 



AUi5_i 9 = 0-5 depending on the number of A or U bases on the sense strand 



positions 15-19; 

15 Gb = 1 if G is the base at position 1 3 on the sense strand, otherwise its value is 



C19 = 1 if C is the base at position 19 of the sense strand, otherwise its value is 



GC= the number of G and C bases in the entire sense strand; 
20 A 3 = 1 if A is the base at position 3 on the sense strand, otherwise its value is 0; 

An= 1 if A is the base at position 11 on the sense strand, otherwise its value is 

0; 

Ai 4 = 1 if A is the base at position 14 on the sense strand^ otherwise its value is 

0; 

25 Uio= 1 if U is the base at position 10 on the sense strand, otherwise its value is 



U 5 = 1 if U is the base at position 5 on the sense strand, otherwise its value is 0; 



3 0 Formula VIII Relative functionality of siRNA = 
(-14)*Gi3-13*Ai-12^ 

9*Aio-9*U 9 -9*Ci 8 -8*Gio-7*Ui-7*Ui6-7*Ci7-7*Ci9 
+7*Ui7+8*A2+8*A4+8*A5+8*C 4 +9*G 8 +10*A7+10*Ui8+ll*Ai 9 + 
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11*09+15*0!+ 18*A 3 +19*U 10 -Tm-3* (GC to tai) - 6*(GCi 5 -i 9 )- 
30*X; and 

Formula IX Relative functionality of siRNA = 
5 (14.1)*A 3 +(14.9)*A6+(17.6)*A 13 +(24.7)*Ai9+(14.2)*Uio+(10.5)* 
C 9 +(23.9)*Gi+(16.3)*G 2 +(-12.3)*Aii+(-19.3)*Ui+(-12.1)*U2+ 
(.ll)*U 3 +(-15.2)*Ui5+(-11.3)*Ui6+(-11.8)*C3+("17.4)*C6+(- 
10.5) *C 7 + (-13.7)*Gi3+(-25.9)*Gi 9 -Tm-3* (GCtotai) ~ 6*(GCi 5 -i9>- 
30*X 

10 wherein 

Ai = 1 if A is the base at position 1 of the sense strand, otherwise its value is 0; 

A 2 = 1 if A is the base at position 2 of the sense strand, btherwise its value is 0; 

A 3 = 1 if A is the base at position 3 of the sense strand, otherwise its value is 0; 

A4 = 1 if A is the base at position 4 of the sense strand, otherwise its value is 0; 
15 A 5 = 1 if A is the base at position 5 of the sense strand, otherwise its value is 0; 

A 6 = 1 if A is the base at position 6 of the sense strand, otherwise its value 'is 0; 

A 7 = 1 if A is the base at position 7 of the sense strand, otherwise its value is 0; 

A10 = 1 if A is the base at position 10 of the sense strand, otherwise its value is 0; 

An = 1 if A is the base at position 1 1 of the sense strand, otherwise its value is 0; 
20 A13 = 1 if A is the base atp'osition 13 of the sense strand, otherwise its value is 0; 

Ai 9 = 1 if A is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 1 8 base pairs in length, its value is 0; 

C 3 = 1 if C is the base at position 3 of the sense strand, otherwise its value is 0; 
25 C 4 = 1 if C is the base at position 4 of the sense strand, otherwise its value is 0; 

C 5 = 1 if C is the base at position 5 of the sense strand, otherwise its value is 0; 

C 6 = 1 if C is the base at position 6 of the sense strand, otherwise its value is 0; 

C 7 = 1 if C is the base at position 7 of the sense strand, otherwise its value is 0; 

C 9 = 1 if C is the base at position 9 of the sense strand, otherwise its value is 0; 
30 C17 = 1 if C is the base at position 17 of the sense strand, otherwise its value is 0; 

Cis = 1 if C is the base at position 18 of the sense strand, otherwise its value is 0; 

C19 = 1 if C is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 18 base pairs in length, its value is 0; 
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Gi = 1 if G is the base at position 1 on the sense strand, otherwise its value is 0; 
G 2 = 1 if G is the base at position 2 of the sense strand, otherwise its value is 0; 
G 8 = 1 if G is the base at position 8 on the sense strand, otherwise its value is 0; 
do = 1 if G is the base at position 10 on the sense strand, otherwise its value is 0; 
5 Gis = 1 if G is the base at position 13 on the sense strand, otherwise its value is 0; 

Gi9= 1 if G is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 18 base pairs in length, its value is 0; 

Ui = 1 if U is the base at position 1 on the sense strand, otherwise its value is 0; 
10 U 2 = 1 if U is the base at position 2 on the sense strand, otherwise its value is 0; 

U3 = 1 if U is the base at position 3 on the sense strand, otherwise its value is 0; 

U4 = 1 if U is the base at position 4 on the sense strand, otherwise its value is 0; 

U7 = 1 if U is the base at position 7 on the sense strand, otherwise its value is 0; 

U9 = 1 if U is the base at position 9 on the sense strand, otherwise its value is 0; 
15 U10 = 1 if U is the base at position 10 on the sense strand, otherwise its value is 0; 

U15 = 1 if U is the base at position 1 5 on the sense strand, otherwise its value is 0; 

Ui6 = 1 if U is the base at position 16 on the sense strand, otherwise its value is 0; 

U17 = 1 if U is the base at position 17 on the sense strand, otherwise its value is 0; 

Uig = 1 if U is the base at position 18 on the sense strand, otherwise its value is 0; 

20 

GCi5_i9 = the number of G and C bases within positions 15 - 19 of the sense strand 

or within positions 15 —18 if the sense strand is only 18 base pairs in length; 
GC to tai = the number of G and C bases in the sense strand; - -«* • - ■ = - . 

Tm= 100 if the targeting site contains an inverted repeat longer than 4 base pairs, 
25 otherwise its value is 0; and 

X = the number of times that the same nucleotide repeats four or more times in a row. 

According to a third embodiment, the present invention is directed to a kit 
comprised of at least one siRNA that contains a sequence that is optimized according 
30 to one of the formulas above. Preferably the kit contains at least two optimized 
siRNA, each of which comprises a duplex, wherein one strand of each duplex 
comprises at least eighteen contiguous bases that are complementary to a region of a 
target messenger RNA. For mammalian systems, the siRNA preferably comprises 
between 18 and 30 nucleotide base pairs. 
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The ability to use the above algorithms, which are not sequence or species 
specific, allows for the cost-effective selection of optimized siRNAs for specific 
target sequences. Accordingly, there will be both greater efficiency and reliability in 
5 the use of siRNA technologies. 

According to a fourth embodiment, the present invention provides a method 
for developing an siRNA algorithm for selecting functional and hyperfunctional 
siRNAs for a given sequence. The method comprises: 
10 (a) . selecting a set of siRNAs; 

(b) measuring the gene silencing ability of each siRNA from said set; 

(c) determining the relative functionality of each siRNA; 

(d) determining the amount of improved functionality by the presence or 
absence of at least one variable selected from the group consisting of 

15 the total GC content, melting temperature of the siRNA, GC content at 

positions 15—19, the presence or absence of a particular nucleotide at a 
particular position and the number of times that the same nucleotide 
repeats within a given sequence; and 

(e) developing an algorithm using the information of step (d). 

20 

According to this embodiment, preferably the set of siRNAs comprises at least 
90 siRNAs from at least one gene, more preferably at least 1 80 siRNAs from at least 
two different genes, and most preferably at least 270 and 360 siRNAs from at least 
three and four different genes, respectively. Additionally, in step (d) the 
25 determination is made with preferably at least two, more preferably at least three, 
even more preferably at least four, and most preferably all of the variables. The 
resulting algorithm is not target sequence specific. 

In a fifth embodiment, the present invention provides rationally designed 
30 siRNAs identified using the formulas above. 



In a sixth embodiment, the present invention is directed to hyperfunctional 

siRNA. 
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For a better understanding of the present invention together with other and 
further advantages and embodiments, reference is made to the following description 
taken in conjunction with the examples, the scope of which is set forth in the 
appended claims. 

5 

Brief Description of the Figures 

Figure 1 shows a model for siRNA-RISC interactions. RISC has the ability to interact 
with either end of the siRNA or miRNA molecule. Following binding, the duplex is 
unwound, and the relevant target is identified, cleaved, and released. 

10 

Figure 2 is a representation of the functionality of two hundred and seventy siRNA 
duplexes that were generated to target human cyclophilin, human diazepam-binding 
inhibitor (DB), and firefly luciferase. 

15 Figure 3 a is a representation of the silencing effect of 30 siRNAs in three different 
cells lines, HEK293, DU145, and Hela. Figure 3b shows the frequency of different 
functional groups (>95% silencing (black), >80% silencing (gray), >50% silencing 
(dark gray), and <50% silencing (white)) based on GC content. In cases where a given 
bar is absent from a particular GC percentage, no siRNA were identified for that 

20 particular group. Figure 3c shows theirequency of different functional groups based 
on melting temperature (Tm). Again, each group has four different divisions: >95% 
(black), >80% (gray), >50% (dark gray), and <50% (white) silencing. 

Figure 4 is a representation of a statistical analysis that revealed correlations between 
25 silencing and five sequence-related properties of siRNA: (A) an A at position 19 of 
the sense strand, (B) an A at position 3 of the sense strand, (C) a U at position 10 of 
the sense strand, (D) abase other than G at position 13 of the sense strand, and (E) a 
base other than C at position 19 of the sense strand. All variables were correlated with 
siRNA silencing of firefly luciferase and human cyclophilin. SiRNAs satisfying the 
30 criterion are grouped on the left (Selected) while those that do not, are grouped on the 
right (Eliminated). Y-axis is "% Silencing of Control." Each position on the X-axis 
represents a unique siRNA. _ 
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Figures 5 A and 5 B are representations of firefly luciferase and cyclophilin siRNA 
panels sorted according to functionality and predicted values using Formula VIII. The 
siRNA found within the circle represent those that have Formula VIII values 
(SMARTscores™) above zero. SiRNA outside the indicated area have calculated 
5 Formula VIII values that are below zero. Y-axis is "Expression (% Control)." Each 
position on the X-axis represents a unique siRNA. 

Figure 6A is a representation of the average internal stability profile (AISP) derived 
from 270 siRNAs taken from three separate genes (cyclophilin B 3 DBI and firefly 
10 luciferase). Graphs represent AISP values of highly functional, functional, and non- 
functional siRNA. Figure 6B is a comparison between the AISP of naturally derived 
GFP siRNA (filled squares) and the AISP of siRNA from cyclophilin B,-DBI, and 
luciferase having >90% silencing properties (no fill) for the antisense strand. "DG" is 
the symbol for AG, free energy. 

15 

Figure 7 is a histogram showing the differences in duplex functionality upon 
introduction of basepair mismatches. The X-axis shows the mismatch introduced inot 
the siRNA and the position it is introduced (e.g., 8C->A reveals that position 8 (which 
normally has a C) has been changed to an A). The Y-axis is "% Silencing 
20 (Normalized to Control)." 

Figure 8a is histogram that shows the effects of 5 5 sense and antisense strand 
- - - modification with 2'-0-methylation on functionality. Figure 8b is an expression * 

profile showing a comparison of sense strand off-target effects for IGF1R-3 and 2'-0- 
25 methyl IGF1R-3. Sense strand off-targets (lower white box) are not induced when the 

5 ? end of the sense strand is modified with 2'-0-methyl groups (top white box). 

Figure 9 shows a graph of SMARTscores™ versus RNAi silencing values for more 
than 360 siRNA directed against 30 different genes. SiRNA to the right of the vertical 
30 bar represent those siRNA that have desirable SMARTscores™. 



Figures 10A - E compare the RNAi of five different genes (SEAP, DBI, PLK, 
Firefly Luciferase, and Renila Luciferase) by varying numbers of randomly selected 
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siRNA and four rationally designed (SMART-selected) siRNA chosen using the 
algorithm described in Formula VIII. In addition, RNAi induced by a pool of the four 
SMART-selected siRNA is reported at two different concentrations (100 and 400nM). 
10F is a comparison between a pool of randomly selected EGFR siRNA (Pool 1) and 
5 a pool of SMART selected EGFR siRNA (Pool 2). Pool 1 , S 1— S4 and Pool 2 S 1— 
S4 represent the individual members that made up each respective pool. Note that 
numbers for random siRNAs represent the position of the 5 5 end of the sense strand of 
the duplex. The Y-axis represents the % expression of the control(s). The X-axis is 
the percent expression of the control. 

10 

Figure 11 shows the Western blot results from cells treated with siRNA directed 
against twelve different genes involved in the clathrin-dependent endocytosis " 
pathway (CHC, Dynll, CALM, CLCa, CLCb, EpslS, EpslSR, Rab5a, Rab5b, Rab5c, 
|32 subunit of AP-2 and EEA.l). SiRNA were selected using Formula VIII. "Pool" 
15 represents a mixture of duplexes 1-4. Total concentration of each siRNA in the pool . 
is 25 nM. Total concentration = 4 x 25 = 100 nM. 

Figure 12 is a representation of the gene silencing capabilities of rationally-selected 
siRNA directed against ten different genes (human and mouse cyclophilin, C-myc 5 
20 human lamin A/C, QB (ubiquinol-cytochromeTc reductase core protein I), MEK1 and 
MEK2, ATE1 (arginyl-tRNA protein transferase), GAPDH, and Eg5). The Y-axis is 
the percent expression of the control. Numbers 1 5 2, 3 and 4 represent individual 
rationally selected siRNA. "Pool" represents a mixture of the four individual siRNA. 

25 Figure 13 is the sequence of the top ten Bcl2 siRNAs as determined by Formula VIII. 
Sequences are listed 5' to 3\ 

Figure 14 is the knockdown by the top ten Bcl2 siRNAs at lOOnM concentrations. 
The Y-axis represents the amount of expression relative to the non-specific (ns) and 
30 transfection mixture control. 



Figure 15 represents a functional walk where siRNA beginning on every other base 
pair of a region of the luciferase gene are tested for the ability to silence the luciferase 
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gene. The Y-axis represents the percent expression relative to a control. The X-axis 
represents the position of each individual siRNA. 



Figure 16 is a histogram demonstrating the inhibition of target gene expression by 
5 pools of 2 and 3 siRNAs duplexes taken from the walk described in Figure 15. The 
Y-axis represents the percent expression relative to control. The X-axis represents the 
position of the first siRNA in paired pools, or trios of siRNA. For instance, the first 
paired pool contains siRNA 1 and 3. The second paired pool contains siRNA 3 and 5. 
Pool 3 (of paired pools) contains siRNA 5 and 7, and so on. 

10 

Figure 17 is a histogram demonstrating the inhibition of target gene expression by 
pools of 4 and 5 siRNA duplexes. The Y-axis represents the percent expression* 
relative to a control. The X-axis represents the positoin of the first siRNA in each 
pool. 

15 

Figure 18 is a histogram demonstrating the inhibition of target gene expression by 
siRNAs that are ten and twenty basepairs apart. The Y-axis represents the percent 
expression relative to a control. The X-axis represents the position of the first siRNA 
in each pool. 

20 j 

Figure 19 shows that pools of siRNAs (dark gray bar) work as well (or better) than 
the best siRNA in the pool (light gray bar). The Y-axis represents the percent 
* expression relative to a control. The-X axis represents the position of the first siRNA 
in each pool. 

25 

Figure 20 shows that the combination of several semifunctional siRNAs (dark gray) 
result in a significant improvement of gene expression inhibition over individual 
(semi-functional; light gray) siRNA. The Y-axis represents the percent expression 
relative to a control. 

30 

Figure 21 shows both pools (Library, Lib) and individual siRNAs in inhibition of 
gene expression of Beta-Galactosidase, Renilla Luciferase and SEAP (alkaline 
phosphatase). Numbers on the X-axis indicate the position of the 5'-most nucleotide 
of the sense strand of the duplex. The Y-axis represents the percent expression of 
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each gene relative to a control. Libraries contain siRNAs that begin at the following 
nucleotides: Seap: Lib 1: 206, 766, 812,923, Lib 2: 1117, 1280, 1300, 1487, Lib 3: 
206, 766, 812, 923, 1117, 1280, 1300,1487, Lib 4: 206, 812, 1 117, 1300, Lib 5: 766, 
923, 1280, 1487, Lib 6: 206, 1487; Bgal: Lib 1: 979, 1339, 2029, 2590, Lib 2: 
5 1087,1783,2399,3257, Lib 3: 979, 1783, 2590, 3257, Lib 4: 979, 1087, 1339, 1783, 
2029, 2399,2590,3257, Lib 5: 979, 1087, 1339, 1783, Lib 6: 2029,2399,2590,3257; 
Renilla: Lib 1: 174,300,432,568, Lib 2: 592, 633, 729,867, Lib 3: 174,300,432, 
568, 592, 633,729,867, Lib 4: 174,432,592,729, Lib 5: 300,568,633,867, Lib 6: 
592,568. 

10 

Figure 22 showS the results of an EGFR and TfoR internalization assay when single 
gene knockdowns are performed. The Y-axis represents percent internalization 
relative to control. 

15 

Figure 23 shows the results of an EGFR and TfiiR internalization assay when 
multiple genes are knocked down (e.g. Rab5a, b, c). The Y-axis represents the 
percent internalization relative to control. 

20 Figure 24 shows the simultaneous knockdown of four different genes. SiRNAs 

directed against G6PD, GAPDH, PLK, and UBQ were simultaneously introduced into 
cells. Twenty- four hours later, cultures were harvested and assayed for mRNA target 
; levels for^each of the four genes. A comparison is made between cells transfected 
with individual siRNAs vs. a pool of siRNAs directed against all four genes. 

25 

Figure 25 shows the functionality of ten siRNAs at 0.3nM concentrations. 

Detailed Description 
Definitions 

30 Unless stated otherwise, the following terms and phrases have the meanings 

provided below: 
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siRNA 

The term "siRNA" refers to small inhibitory RNA duplexes that induce the 
RNA interference (RNAi) pathway. These molecules can vary in length (generally 
between 18-30 basepairS) and contain varying degrees of complementarity to their 
5 target mRNA in the antisense strand. Some, but not all, siRNA have unpaired 

overhanging bases on the 5' or 3 5 end of the sense strand and/or the antisense strand. 
The term "siRNA" includes duplexes of two separate strands, as well as single strands 
that can form hairpin structures comprising a duplex region. 

10 SiRNA may be divided into five (5) groups (non-functional, semi-functional, 

functional, highly functional, and hyper-functional) based on the level or degree of 
silencing that they induce in cultured cell lines. As used herein, these definitions are 
based on a set of conditions where the siRNA is transfected into said cell line at a 
concentration of lOOnM and the level of silencing is tested at a time of roughly 24 

15 hours after transfection, and not exceeding 72 hours after transfection. In this context, 
"non-functional siRNA" are defined as those siRNA that induce less than 50% 
(<50%) target silencing. "Semi-functional siRNA" induce 50-79% target silencing. 
"Functional siRNA" are molecules that induce 80-95% gene silencing. "Highly- 
functional siRNA" are molecules that induce greater than 95% gene silencing. 

20 "Hyperfunctional siRNA" are a special class of molecules: For purposes of this 

document, hyperfunctional siRNA are defined as those molecules that: (1) induce 
greater than 95% silencing of a specific target when they are transfected at 
subnanomolar concentrations (i.e., less than one nanomolar); and/or (2) induce 
functional (or better) levels of silencing for greater than 96 hours. These relative 

25 functionalities (though not intended to be absolutes) may be used to compare siRNAs 
to a particular target for applications such as functional genomics, target identification 
and therapeutics. 

miRNA 

30 The term "miRNA" refers to microRNA. 



Gene silencing 

The phrase "gene silencing" refers to a process by which the expression of a 
specific gene product is lessened or attenuated. Gene silencing can take place by a 
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variety of pathways. Unless specified otherwise, as used herin, gene silencing refers 
to decreases in gene product expression that results from RNA interference (RNAi), a 
defined, though partially characterized pathway whereby small inhibitory RNA 
(siRNA) act in concert with host proteins (e.g. the RNA induced silencing complex, 
5 RISC) to degrade messenger RNA (mRNA) in a sequence-dependent fashion. The 
level of gene silencing can be measured by a variety of means, including, but not 
limited to, measurement of transcript levels by Northern Blot Analysis, B-DNA 
techniques, transcription-sensitive reporter constructs, expression profiling (e.g. DNA 
chips), and related technologies. Alternatively, the level of silencing can be measured 
10 by assessing the level of the protein encoded by a specific gene. This can be 
accomplished by performing a number of studies including Western Analysis, 
measuring the levels of expression of a reporter protein that has e.g. fluorescent 
properties (e.g. GFP) or enzymatic activity (e.g. alkaline phosphatases), or several 
other procedures. 

15 

Transfection 

The term "transfection" refers to a process by which agents are introduced into 
a cell. The list of agents that can be transfected is large and includes, but is not 
limited to, siRNA, sense and/or anti-sense sequences, DNA encoding one or more 
20 genes and organized into an expression plasmid, proteins, protein fragments, and 

more. There are multiple methods for transfecting agents into a cell including, but not 
limited to, electroporation, calcium phosphate-based transfections, DEAE-dextran- 
based transfections, lipid-based transfections, molecular conjugate-based transfections 
(e.g. polylysine-DNA conjugates), microinjection and others. 

25 

Target 

The term "target" is used in a variety of different forms throughout this 
document and is defined by the context in which it is used. "Target mRNA" refers to 
a messenger RNA to which a given siRNA can be directed against. "Target sequence" 
30 and "target site" refer to a sequence within the mRNA to which the sense strand of an 
siRNA shows varying degrees of homology and the antisense strand exhibits varying 
degrees of complementarity. The term "siRNA target" can refer to the gene, mRNA, 
or protein against which an siRNA is directed. Similarly "target silencing" can refer 
to the state of a gene, or the corresponding mRNA or protein. 
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Off-target silencing and Off-target interference 

The phrases "off-target silencing" and "off-target interference" are defined as 
degradation of mRNA other than the intended target mRNA due to overlapping and/or 
5 partial homology with secondary mRNA messages. 

SMARTscore™ 

The term "SMARTscore™" refers to a number determined by applying any of 
the Formulas I - Formula IX to a given siRNA sequence. The term "SMART- 
10 selected" or "rationally selected" or "rational selection" refers to siRNA that have 
been selected on the basis of their SMARTscores™. 

Complementary 

The term "complementary" refers to the ability of polynucleotides to form 
1 5 base pairs with one another. Base pairs are typically formed by hydrogen bonds 
between nucleotide units in antiparallel polynucleotide strands. Complementary 
polynucleotide strands can base pair in the Watson-Crick maimer (e.g., A to T, A to 
U, C to G), or in any other manner that allows for the formation of duplexes. As 
persons skilled in the art are aware, when using RNA as opposed to DNA, uracil 
20 rather than thymine is the base that is considered to be complementary to adenosine. 
However, when a U is denoted in the context of the present invention, the ability to 
substitute a T is implied, unless otherwise stated. 

Perfect complementarity or 1 00% complementarity refers to the situation in 
25 which each nucleotide unit of one polynucleotide strand can hydrogen bond with a 

nucleotide unit of a second polynucleotide strand. Less than perfect complementarity 
refers to the situation in which some, but not all, nucleotide units of two strands can 
hydrogen bond with each other. For example, for two 20-mers, if only two base pairs 
on each strand can hydrogen bond with each other, the polynucleotide strands exhibit 
30 10% complementarity. In the same example, if 1 8 base pairs on each strand can 
hydrogen bond with each other, the polynucleotide strands exhibit 90% 
complementarity. "Substantial complementarity" refers to polynucleotide strands 
exhibiting 79% or greater complementarity, excluding regions of the polynucleotide 
strands, such as overhangs, that are selected so as to be noncomplementary. 
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("Substantial similarity" refers to polynucleotide strands exhibiting 79% or greater 
similarity, excluding regions of the polynucleotide strands, such as overhangs, that are 
selected so as not to be similar.) Thus, for example, two polynucleotides of 29 
nucleotide units each, wherein each comprises a di-dT at the 3 5 terminus such that the 
5 duplex region spans 27 bases, and wherein 26 of the 27 bases of the duplex region on 
each strand are complementary, are substantially complementary since they are 963% 
complementary when excluding the di-dT overhangs. 



Deoxvnucleotide 

1 0 The term "deoxynucleotide" refers to a nucleotide or polynucleotide lacking a 

hydroxyl group (OH group) at the 2 5 and/or 3' position of a sugar moiety. Instead, it 
has a hydrogen bonded to the 2 5 and/or 3' carbon. Within an RNA molecule that 
comprises one or more deoxynucleotides, "deoxynucleotide" refers to the lack of an 
OH group at the 2' position of the sugar moiety, having instead a hydrogen bonded 

1 5 directly to the 2' carbon. 

Deoxvribonucleotide 

The terms "deoxyribonucleotide" and "DNA" refer to a nucleotide or 
polynucleotide comprising at least one sugar moiety that has an H, rather than an OH, 
20 at its 2' and/or 3 'position. 

Substantially Similar 

The phrase^substantially similar" refers to a similarity of at least 90% with 
respect to the identity of the bases of the sequence. 

25 

Duplex Region 

The phrase "duplex region" refers to the region in two complementary or 
substantially complementary polynucleotides that form base pairs with one another, 
either by Watson-Crick base pairing or any other manner that allows for a stabilized 
30 duplex between polynucleotide strands that are complementary or substantially 

complementary. For example, a polynucleotide strand having 21 nucleotide units can 
base pair with another polynucleotide of 21 nucleotide units, yet only 19 bases on 
each strand are complementary or substantially complementary, such that the "duplex 
region" has 19 base pairs. The remaining bases may, for example, exist as 5' and 3' 
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overhangs. Further, within the duplex region, 100% complementarity is not required; 
substantial complementarity is allowable within a duplex region. Substantial 
complementarity refers to 79% or greater complementarity. For example, a mismatch 
in a duplex region consisting of 19 base pairs results in 94.7% complementarity, 
5 rendering the duplex region substantially complementary. 

Nucleotide 

The term "nucleotide" refers to a ribonucleotide or a deoxyribonucleotide or 
modified form thereof, as well as an analog thereof. Nucleotides include species that 
10 comprise purines, e.g., adenine, hypoxanthine, guanine, and their derivatives and 
analogs, as well as pyrimidines, e.g., cytosine, uracil, thymine, and their derivatives 
and analogs. 



Nucleotide analogs include nucleotides having modifications in the chemical 
1 5 structure of the base, sugar and/or phosphate, including, but not limited to, 5-position 
pyrimidine modifications, 8-position purine modifications, modifications at cytosine 
exocyclic amines, and substitution of 5-bromo-uracil; and 2 '-position sugar 
modifications, including but not limited to, sugar-modified ribonucleotides in which 
the 2'-OH is replaced by a group such as an H, OR, R, halo, SH, SR, NH 2 , NHR, 
20 NR 2 , or CN, wherein R is an alkyl moiety. Nucleotide analogs are also meant to 
include nucleotides with bases such as inosine, queuosine, xanthine, sugars such as 
2'-methyl ribose, non-natural phosphodiester linkages such as methylphosphonates, 
phosphorothioates and~peptides. 

25 Modified bases refer to nucleotide bases such as, for example, adenine, 

guanine, cytosine, thymine, uracil, xanthine, inosine, and queuosine that have been 
modified by the replacement or addition of one or more atoms or groups. Some 
examples of types of modifications that can comprise nucleotides that are modified 
with respect to the base moieties include but are not limited to, alkylated, halogenated, 

30 thiolated, animated, amidated, or acetylated bases, individually or in combination. 

More specific examples include, for example, 5-propynyluridine, 5-propynylcytidine, 
6-methyladenine, 6-methylguanine, N,N,-dimethyladenine, 2-propyladenine, 2- 
propylguanine, 2-aminoadenine, 1-methylinosine, 3-methyluridine, 5-methylcytidine, 
5-methyluridine and other nucleotides having a modification at the 5 position, 5-(2- 
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amino)propyl uridine, 5-halocytidine, 5-halouridine, 4-acetylcytidine, 1- 
methyladenosine, 2-methyladenosine, 3-methylcytidine, 6-methyluridine, 2- 
methylguanosine, 7-methylguanosine, 2,2-dimethylguanosine, 5- 
methylaminoethyluridine, 5-methyloxyuridine, deazanucleotides such as 7-deaza- 
5 adenosine, 6-azouridine, 6-azocytidine, 6-azothymidine, 5-methyl-2~thiouridine, other 
thio bases such as 2-thiouridine and 4-thiouridine and 2-thiocytidine, dihydrouridine, 
pseudouridine, queuosine, archaeosine, naphthyl and substituted naphthyl groups, any 
O- and N-alkylated purines and pyrimidines such as N6-methyladenosine, 5- 
methylcarbonylmethyluridine, uridine 5-oxyacetic acid, pyridine-4-one, pyridine-2- 

10 one, phenyl and modified phenyl groups such as aminophenol or 2,4,6-trimethoxy 
benzene, modified cytosines that act as G-clamp nucleotides, 8-substituted adenines 
and guanines, 5-substituted uracils and thymines, azapyrimidines, 
carboxyhydroxyalkyl nucleotides, carboxyalkylaminoalkyl nucleotides, and 
alkylcarbonylalkylated nucleotides. Modified nucleotides also include those 

15 nucleotides that are modified with respect to the sugar moiety, as well as nucleotides 
having sugars or analogs thereof that are not ribosyl. For example, the sugar moieties 
maybe, or be based on, mannoses, arabinoses, glucopyranoses, galactopyranoses, 4'- 
thioribose, and other sugars, heterocycles, or carbocycles. 



20 The term nucleotide is also meant to include what are known in the art as 

universal bases. By way of example, universal bases include but are not limited to 3- 
nitropyrrole, 5-nitroindole, or nebularine. The term "nucleotide" is also meant to 
include the N3 ? to P5 5 :phosj)heramidate, resulting from the substitution of a ribosyl 3' 
oxygen with an amine group. 

25 

Further, the term nucleotide also includes those species that have a detectable 
label, such as for example a radioactive or fluorescent moiety, or mass label attached 
to the nucleotide. 



30 Polynucleotide 

The term "polynucleotide" refers to polymers of nucleotides, and includes but 
is not limited to DNA, RNA, DNA/RNA hybrids including polynucleotide chains of 
regularly and/or irregularly alternating deoxyribosyl moieties and ribosyl moieties 
(i.e., wherein alternate nucleotide units have an -OH, then and -H, then an -OH, then 
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an — H, and so on at the 2 5 position of a sugar moiety), and modifications of these 
kinds of polynucleotides, wherein the attachment of various entities or moieties to the 
nucleotide units at any position are included. 



5 Polyribonucleotide 

The term "polyribonucleotide" refers to a polynucleotide comprising two or 
more modified or unmodified ribonucleotides and/or their analogs. The term 
"polyribonucleotide" is used interchangeably with the term "oligoribonucleotide." 

10 Ribonucleotide and ribonucleic acid 

The term "ribonucleotide" and the phrase "ribonucleic acid" (RNA), refer to a 
modified or unmodified nucleotide or polynucleotide comprising at least one 
ribonucleotide unit. A ribonucleotide unit comprises an hydroxyl group attached to 
the 2' position of a ribosyl moiety that has a nitrogenous base attached in N- 

1 5 glycosidic linkage at the 1 ' position of a ribosyl moiety, and a moiety that either 
allows for linkage to another nucleotide or precludes linkage. 



Detailed Description of the Invention 

_ rr 20 The present invention is directed to improving the efficiency of gene silencing 

by siRNA. Through the inclusion of multiple siRNA sequences that are targeted to a 
particular gene and/or selecting an siRNA sequence based on certain defined criteria, 
improved efficiency may be achieved. 

25 The present invention will now be described in connection with preferred 

embodiments. These embodiments are presented in order to aid in an understanding 
of the present invention and are not intended, and should not be construed, to limit the 
invention in any way. All alternatives, modifications and equivalents that may 
become apparent to those of ordinary skill upon reading this disclosure are included 

30 within the spirit and scope of the present invention. 

Furthermore, this disclosure is not a primer on RNA interference. Basic 
concepts known to persons skilled in the art have not been set forth in detail. 
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Optimizing siRNA 

According to one embodiment, the present invention provides a method for 
improving the effectiveness of gene silencing for use to silence a particular gene 
through the selection of an optimal siRNA. An siRNA selected according to this 
5 method may be used individually, or in conjunction with the first embodiment, i.e., 
with one or more other siRNAs, each of which may or may not be selected by this 
criteria in order to maximize their efficiency. 



The degree to which it is possible to select an siRNA for a given mRNA that 
10 maximizes these criteria will depend on the sequence of the mRNA itself. However, 
the selection criteria will be independent of the target sequence. According to this 
method, an siRNA is selected for a given gene by using a rational design. That said, 
rational design can be described in a variety of ways. Rational design is, in simplest 
terms, the application of a proven set of criteria that enhance the probability of 
15 identifying a functional or hyperfunctional siRNA. In one method, rationally 
designed siRNA can be identified by maximizing one or more of the following 
criteria: 

I . A low GC content, preferably between about 30 -52%. 

20 2. At least 2, preferably at least 3 A or U bases at positions 1 5- 1 9 of the 

siRNA on the sense strand. 

3. An A base at position 1 9 of the sense strand. 

4. An A base at positionAof the sense strand. 

5. A U base at position 10 of the sense strand. 
25 6. An A base at position 14 of the sense strand. 

7. A base other than C at position 19 of the sense strand. 

8. A base other than G at position 1 3 of the sense strand. 

9. A Tm, which refers to the character of the internal repeat that results in 
inter- or intramolecular structures for one strand of the duplex, that is 

30 preferably not stable at greater than 50°C, more preferably not stable at 

greater than 37°C, even more preferably not stable at greater than 30°C 
and most preferably not stable at greater than 20°C. 

10. A base other than U at position 5 of the sense strand. 

II. A base other than A at position 1 1 of the sense strand. 
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Criteria 5, 6, 10 and 1 1 are minor criteria, but are nonetheless desirable. 
Accordingly, preferably an siRNA will satisfy as many of the aforementioned criteria 
as possible, more preferably at least 1—4 and 7-9, and most preferably all of the 
5 criteria 

With respect to the criteria, GC content, as well as a high number of AU in 
positions 15-19, may be important for easement of the unwinding of double stranded 
siRNA duplex. Duplex unwinding has been shown to be crucial for siRNA 
1 0 functionality in vivo . 

With respect to criterion 9, the internal structure is measured in terms of the 
melting temperature of the single strand of siRNA, which is the temperature at which 
50% of the molecules will become denatured. With respect to criteria 2-8 and 10 — 
15 11, the positions refer to sequence positions on the sense strand, which is the strand 
that is identical to the mRNA. 

In one preferred embodiment, at least criteria 1 and 8 are satisfied. In another 
preferred embodiment, at least criteria 7 and 8 are satisfied. In still another preferred 
2GL embodiment, at least criteria 1, 8 and 9 are satisfied. 

It should be noted that all of the aforementioned criteria regarding sequence 
position specifics are with respect toAesS&end of the sense strand. Reference is 
made to the sense strand, because most databases contain information that describes 
25 the information of the mRNA. Because according to the present invention a chain can 
be from 18 to 30 bases in length, and the aforementioned criteria assumes a chain 19 
base pairs in length, it is important to keep the aforementioned criteria applicable to 
the correct bases. 

30 When there are only 1 8 bases, the base pair that is not present is the base pair 

that is located at the 3' of the sense strand. When there are twenty to thirty bases 
present, then additional bases are added at the 5 5 end of the sense chain and occupy 
positions ~1 to "1 1 . Accordingly, with respect to SEQ. ID NO. 0001 . 
NNANANNNNUCNAANNNNA and SEQ. ID NO. 0028. 
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GUCNNANANNNTNTUCNAANNNNA, both would have A at position 3, A at 
position 5, U at position 10, C at position 1 1, A and position 13, A and position 14 
and A at position 19. However, SEQ. ID NO. 0028 would also have C at position -1, 
U at position -2 and G at position —3. 

5 

For a 19 base pair siRNA, an optimal sequence of one of the strands may be 
represented below, where N is any base, A, C, G, or U: 

SEQ. ID NO. 0001. NNANANNNNUCNAANNNNA 
10 SEQ. ID NO. 0002. NNANANNNNUGNAANNNNA 

SEQ. ID NO. 0003. NNANANNNNUUNAANNNNA 

SEQ. ID NO. 0004. NNANAM'^SnSTUCNCANNNNA 

SEQ. ID NO. 0005. NNANANNNNUGNCANNNNA 

SEQ. ID NO. 0006. NNANANNNNUUNCANNNNA 
1 5 SEQ. ID NO. 0007. NNANANNNNUCNUANNNNA 

SEQ. ID NO. 0008.. NNANANNNNUGNUANNNNA 

SEQ. ID NO. 0009. NNANANNNNUUNUANNNNA 

SEQ. ID NO. 0010. NNANCNNNNUCNAANNNNA 

SEQ. ID NO. 001 1 . NNANCNNNNUGNAANNNNA 
20 ..SEQ. ID NO. 0012. NNANCNNNNUUNAANNNNA 

SEQ. ID NO. 0013. NNANCNNNNUCNCANNNNA 

SEQ. ID NO. 0014. NNANCNNNNUGNCANNNNA 

SEQ. ID NO. 0015. NNANCNNNNUTOa4Mb!NNA 

SEQ. ID NO. 0016. NANCNNNNUCNUANNNNA 
25 SEQ. ID NO. 001 7. NNANCNNNNUGNUANNNNA 

SEQ. ID NO. 0018. NNANCNNNNUUNUANNNNA 

SEQ. ID NO. 0019. NNANGNNNNUCNAANNNNA 

SEQ. ID NO. 0020. NNANGNNNNUGNAANNNNA 

SEQ. ID NO. 002 1 . NNANGNNNNUUNAANNNNA 
30 SEQ. ID NO. 0022. NNANGNNNNUCNCANNNNA 

SEQ. ID NO. 0023. NNANGNNNNUGNCANNNNA 

SEQ. ID NO. 0024. NNANGNNNNUUNCANNNNA 

SEQ. ID NO. 0025. NNANGNNNNUCNUANNNNA 

SEQ. ID NO. 0026. NNANGNNNNUGNUANNNNA 
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SEQ. ID NO. 0027. NNANGNNNNNUNUANNNNA 

In one embodiment, the sequence used as an siRNA is selected by choosing 
the siRNA that score highest according to one of the following seven algorithms that 
5 are represented by Formulas I - VII: 

Formula I 

Relative functionality of siRNA= -(GC/3) +(AU 15 . 19 ) -(Tm 20 o C )*3 -(G 13 )*3 _( Cl9 ) 
+(A 19 )*2 +(A 3 ) +(U 10 )+(A 14 ) -(U 5 ) -(A n ) 

10 

Formula II 

Relative functionality of siRNA= -(GC/3) -(AU 15 . 19 )*3 -(G 13 )*3 -(C 19 ) +(A 19 )*2 
+(A 3 ) 

15 Formula III 

Relative functionality of siRNA^ -(GC/3) +(AU i5 . 19 ) -(Tm 20 ° c )*3 

Formula IV 

Relative functionality of siRNA= 

20 -GC/ZH AU Z5 . 19 )/2-(Tm 20 oc)*2 -(G 13 )*3 -(C 19 ) +(A 19 )*2 +(A 3 ) +(U 10 )+(A 14 ) -(U 5 ) - 
(An) 

Formula V 

Relative functionality of siRNA=-(G 13 )*3 -(C 19 ) +(A 19 )*2 +(A 3 ) + (U 10 )+(A 14 ) -(U 5 ) 
25 -(AO 

Formula VI 

Relative functionality of siRNA=-(Gi 3 )*3 -(C 19 ) +(A 19 )*2 +(A 3 ) 
30 Formula VII 

Relative functionality of siRNA=-(GC/2) +(AU 15 . 19 )/ 2 -( Tm 20 . c )*l -(G 13 )*3 -(C 19 ) 
+(A 19 )*3 +(A 3 )*3 +(U 10 )/2+(A 14 )/2 -(U 5 )/2 -(A u )/2 



In Formulas I - VII: 
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wherein A X9 = 1 if A is the base at position 19 on the sense strand, otherwise its value 
is 0, 

AU15-19 = 0-5 depending on the number of A or U bases on the sense strand 



at 



0; 



positions 15-19; 

G13 = 1 if G is the base at position 13 on the sense strand, otherwise its value is 



C19 = 1 if C is the base at position 19 of the sense strand, otherwise its value is 

0; 

10 GC= the.number of G and C bases in the entire sense strand; 

Tm 20 o c=l if the Tm is greater than 20°C; 

A 3 = 1 if A is the base at position' 3 on the sense strand, otherwise its value is 0; 
Ui(f= 1 if U is the base at position 10 on the sense strand, otherwise its value is 

0; 

15 A 14 = 1 if A is the base at position 14 on the sense strand, otherwise its value 

is 0; 

U 5 = 1 if U is the base at position 5 on the sense strand, otherwise its value is 

0; and 

An = 1 if A is the base at position 1 1 of the sense strand, otherwise its value is 

20 0. 



Formulas I -VII provide relative information regarding functionality. When 
the values for two sequences are compared for^a gi^venHformula, the relative 
functionality is ascertained; a higher positive number indicates a greater functionality. 
25 For example, in many applications a value of 5 or greater is beneficial. 

Additionally, in many applications, more than one of these formulas would 
provide useful information as to the relative functionality of potential siRNA 
sequences. However, it is beneficial to have more than one type of formula, because 
30 not every formula will be able to help to differentiate among potential siRNA 

sequences. For example, in particularly high GC mRNAs, formulas that take that 
parameter into account would not be useful and application of formulas that lack GC 
elements (e.g., formulas V and VI) might provide greater insights into duplex 
functionality. Similarly, formula II might by used in situations where hairpin 
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structures are not observed in duplexes, and formula IV might be applicable for 
sequences that have higher AU content. Thus, one may consider a particular sequence 
in light of more than one or even all of these algorithms to obtain the best 
differentiation among sequences. In some instances, application of a given algorithim 
5 may identify an unususally large number of potential siRNA sequences, and in those 
cases, it may be appropriate to re-analyze that sequence with a second algorithm that 
is, for instance, more stringent. Alternatively, it is conceivable that analysis of a 
sequence with a given formula yields no acceptable siRNA sequences (i.e. low 
SMARTscores™). In this instance, it may be appropriate to re-analyze that sequences 

10 with a second algorithm that is, for instance, less stringent. In still other instances, 
analysis of a single sequence with two separate formulas may give rise to conflicting 
results (i.e. one formula generates a set of siRNA with high SMARTscores™ while 
the other formula identifies a set of siRNA with low SMARTscores™). In these 
instances, it may be necessary to determine which weighted factor(s) (e.g. GC 

1 5 content) are contributing to the discrepancy and assessing the sequence to decide 
whether these factors should or should not be included. Alternatively, the sequence 
could be analyzed by a third, fourth, or fifth algorithm to identify a set of rationally 
designed siRNA. 

20 The above-referenced criteria are particularly advantageous when used in 

combination with pooling techniques as depicted in Table I: 



Table I 



Criteria 


Functional Probability 




Oligos 


Pools 




>95% 


>80% 


<70% 


>95% 


>80% 


<70% 


Current 


33.0 


50.0 


23.0 


79.5 


97.3 


0.3 


New 


50.0 


88.5 


8.0 


93.8 


99.98 


0.005 


(GC) 


28.0 


58.9 


36.0 


72.8 


97.1 


1.6 



25 The term "current" refers to Tuschl's conventional siRNA parameters (Elbashir, S.M. 
et al. (2002) "Analysis of gene function in somatic mammalian cells using small 
interfering RNAs" Methods 26: 199-213). "New" refers to the design parameters 



WO 2004/045543 PCT/US2003/036787 

27 

described in Formulas I- VII. "GC" refers to criteria that select siRNA solely on the 
basis of GC content. 



As Table I indicates, when more functional siRNA duplexes are chosen, 
5 siRNAs that produce <70% silencing drops from 23% to 8% and the number of 

siRNA duplexes that produce >80% silencing rises from 50% to 88.5%. Further, of 
the siRNA duplexes with >80% silencing, a larger portion of these siRNAs actually 
silence >95% of the target expression (the new criteria increases the portion from 
33% to 50%>). Using this new criteria in pooled siRNAs, shows that, with pooling, the 
10 amount of silencing >95% increases from 79.5% to 93.8% and essentially eliminates 
any siRNA pool from silencing less than 70%. 

Table II similarly shows the particularly beneficial results of pooling in 
combination with the aforementioned criteria. However, Table II, which takes into 
1 5 account each of the aforementioned variables, demonstrates even a greater degree of 
improvement in functionality. 



Table II 





Functional Probability 


Oligos 


Pools 


Functional 


Average 


Non- 
functional 


Functional 


Average 


Non- 
functional 


Random 


20 


40 


50 


67 


97 


3 


Criteria 1 


52 


99 


0.1 


97- • a =»e=a.-«=5 


-93 


0.0040 


Criteria 4 


89 


99 


0.1 


99 


99 


0.0000 



The terms "functional," "Average," and "Non-functional" refer to siRNA that exhibit 
20 >80%, >50%, and <50% functionality, respectively. Criteria 1 and 4 refer to specific 
criteria described above. 



25 



The above-described algorithms may be used with or without a computer 
program that allows for the inputting of the sequence of the mRNA and automatically 
outputs the optimal siRNA. The computer program may, for example, be accessible 
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from a local terminal or personal computer, over an internal network or over the 
Internet. 

In addition to the formulas above, more detailed algorithms may be used for 
> selecting siRNA. Preferably, at least one RNA duplex of between 18 and 30 base 
pairs is selected such that it is optimized according a formula selected from: 



Formula VIII: (-14)*Gi3-13*Ai-12*U7-ll*U2-10*Aii-10*U4-10*C3-10*C 5 -10*C6- 
10 9*Aio-9*U 9 -9*Ci8-8*Gio-7*Ur7*Ui6-7*Ci7-7*Ci9 

+7*Ui7+8*A2+8*A4+8*A5+8*C4+9*G 8 +10*A 7 +10*Ui8+ll*Ai9+ 
ll*C 9 +15*Gi+ 18*A 3 +19*Ui 0 -Tm-3* (GC to tai) - 6*(GCi 5 -i 9 > 
30*X;and 



15 Formula IX: 



20 



25 



30 



(14J)*A 3 +(14.9)*A 6 +(17.6)*A 1 3+(24.7)*A 19 +(14.2)*Uio+(10.5)* 
C 9 +(23.9)*Gi+(163)*G 2 +(-123)*A n +(-193)*Ui+(-12.^ 
(-ll)*U3+(-15.2)*Ui5+(-n3)*Ui6+(-11.8)*C 3 +(-17.4)*Q^^ 
10.5)*C 7 + (-13.7)*G 13 +(-25.9)*Gi 9 -Tm-3* (GC total ) - 6*(GCi 5 -i 9 > 
30*X 



wherein 

Ai = 1 if A is the base at position 1 of the sense strand, otherwise its value is 0; 
A 2 = 1 if A is the base at position 2 of the sense strand, otherwise its value is 0; 
A 3 = 1 if A is the base at position 3 of the sense strand, otherwise its value is 0; 
A4 = 1 if A is the base at position 4 of the sense strand, otherwise its value is 0; 
As = 1 if A is the base at position 5 of the sense strand, otherwise its value is 0; 
A6 = 1 if A is the base at position 6 of the sense strand, otherwise its value is 0; 
A 7 = 1 if A is the base at position 7 of the sense strand, otherwise its value is 0; 
Aio= 1 if A is the base at position 10 of the sense strand, otherwise its value is 0; 
An = 1 if A is the base at position 11 of the sense strand, otherwise its value is 0; 
A13 = 1 if A is the base at position 13 of the sense strand, otherwise its value is 0; 
Ai 9 = 1 if A is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 18 base pairs in length, its value is 0; 
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C 3 = 1 if C is the base at position 3 of the sense strand, otherwise its value is 0; 
C4 = 1 if C is the base at position 4 of the sense strand, otherwise its value is 0; 
C 5 = 1 if C is the base at position 5 of the sense strand, otherwise its value is 0; 
C 6 = 1 if C is the base at position 6 of the sense strand, otherwise its value is 0; 
5 C 7 = 1 if C is the base at position 7 of the sense strand, otherwise its value is 0; 
C 9 = 1 if C is the base at position 9 of the sense strand, otherwise its value is 0; 
C17 = 1 if C is the base at position 17 of the sense strand, otherwise its value is 0; 
Cig = 1 if C is the base at position 1 8 of the sense strand, otherwise its value is 0; 
C19 = 1 if C is the base at position 19 of the sense strand, otherwise if another base is 
10 present or the sense strand is only 1 8 base pairs in length, its value is 0; 

Gi = 1 if G is the base at position 1 on the sense strand, otherwise its value is 0; 
G 2 = 1 if G is the base at position 2 of the sense strand, otherwise its value is 0; 
G 8 = 1 if G is the base at position 8 on the sense strand, otherwise its value is 0; 
1 5 G10 = 1 if G is the base at position 1 0 on the sense strand, otherwise its value is 0; 
G13 = 1 if G is the base at position 13 on the sense strand, otherwise its value is 0; 
G19 = 1 if G is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 1 8 base pairs in length, its value is 0; 

20 Ui = 1 if U is the base at position 1 on the sense strand, otherwise its value is Oi 

U2 = 1 if U is the base at position 2 on the sense strand, otherwise its value is 0; 

U 3 = 1 if U is the base at position 3 on the sense strand, otherwise its value is 0; 

U4 = 1 if U is the base at position 4 on the sense strand, otherwise its -value is 0; 

U 7 = 1 if U is the base at position 7 on the sense strand, otherwise its value is 0; 
25 U9 = 1 if U is the base at position 9 on the sense strand, otherwise its value is 0; 

U10 = 1 if U is the base at position 10 on the sense strand, otherwise its value is 0; 

U15 = 1 if U is the base at position 15 on the sense strand, otherwise its value is 0; 

Ui6 = 1 if U is the base at position 16 on the sense strand, otherwise its value is 0; 

U17 = 1 if U is the base at position 17 on the sense strand, otherwise its value is 0; 
30 Ui8 = 1 if U is the base at position 1 8 on the sense strand, otherwise its value is 0; 



GC15-19 = the number of G and C bases within positions 15 - 19 of the sense 

strand, or within positions 1 5 -18 if the sense strand is only 1 8 base pairs in 
length; 
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GCtotai = the number of G and C bases in the sense strand; 

Tm =100 if the siRNA oligo has the internal repeat longer then 4 base pairs, 

otherwise its value is 0; and 
X = the number of times that the same nucleotide repeats four or more times in a 
5 row. 

The above formulas VIII and IX, as well as formulas I - VII, provide methods 
for selecting siRNA in order to increase the efficiency of gene silencing. A subset of 
variables of any of the formulas may be used, though when fewer variables are used, 
1 0 the optimization hierarchy becomes less reliable. 

With respect to the variables of the above-referenced formulas, a single letter 
of A or C or G or U followed by a subscript refers to a binary condition. The binary 
condition is that either the particular base is present at that particular position 

1 5 (wherein the value is "1") or the base is not present (wherein the value is "0"). 

Because position 19 is optional, i.e. there might be only 18 base pairs, when there are 
only 1 8 base pairs, any base with a subscript of 19 in the formulas above would have 
a zero value for that parameter. Before or after each variable is a number followed by 
*, which indicates that the value of the variable is to be multiplied or weighed by that 

20 number. 

The numbers preceding the variables A, or G, or C, or U in Formulas VIII and 
IX (or after the variables in Formula I - VII) were determined by comparing the 
difference in the frequency of individual bases at different positions in functional 

25 siRNA and total siRNA. Specifically, the frequency in which a given base was 
observed at a particular position in functional groups was compared with the 
frequency that that same base was observed in the total, randomly selected siRNA set. 
If the absolute value of the difference between the functional and total values was 
found to be greater than 6%, that parameter was included in the equation. Thus for 

30 instance, if the frequency of finding a "G" at position 13 (G 13 ) is found to be 6% in a 
given functional group, and the frequency of G13 in the total population of siRNAs is 
20%, the difference between the two values is 6%-20% = -14%. As the absolute value 
is greater than six (6), this factor (-14) is included in the equation. Thus in Formula 
VIII, in cases where the siRNA under study has a G in position 13, the accrued value 
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is (-14) * (1) = -14. In contrast, when a base other than G is found at position 13, the 
accrued value is (-14) * (0) = 0. 



When developing a means to optimize siRNAs, the inventors observed that a 
5 bias toward low internal thermodynamic stability of the duplex at the 5'-antisense 
(AS) end is characteristic of naturally occurring miRNA precursors. The inventors 
extended this observation to siRNAs for which functionality had been assessed in 
tissue culture. 

10 With respect to the parameter GC15-19, a value of 0 - 5 will be ascribed 

depending on the number of G or C bases at positions 15 to 19. If there are only 18 
base pairs, the value is between 0 and 4. 



With respect to the criterion GQ ota i content, a number from 0-30 will be 
15 ascribed, which correlates to the total number of G and C nucleotides on the sense 
strand, excluding overhangs. Without wishing to be bound by any one theory, it is 
postulated that the significance of the GC content (as well as AU content at positions 
15-19, which is a parameter for formulas III - VII) relates to the easement of the 
unwinding of a double-stranded siRNA duplex. Duplex unwinding is believed to be 
20 crucial for siRNA functionality in vivo and overall low internal stability, especially 
low internal stability of the first unwound base pair is believed to be important to 
maintain sufficient processivity of RISC complex-induced duplex unwinding. If the 
duplex has 19 base pairs, those at positions 15-19 on the sense strandiwMlamwind first 
if the molecule exhibits a sufficiently low internal stability at that position. As 
25 persons skilled in the art are aware, RISC is a complex of approximately twelve 

proteins; Dicer is one, but not the only, helicase within this complex. Accordingly, 
although the GC parameters are believed to relate to activity with Dicer, they are also 
important for activity with other RISC proteins. 

30 The value of the parameter Tm is 0 when there are no internal repeats longer 

than (or equal to) four base pairs present in the siRNA duplex; otherwise the value is 
1. Thus for example, if the sequence ACGUACGU, or any other four nucleotide (or 
more) palindrome exists within the structure, the value will be one (1). Alternatively 
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if the structure ACGGACG, or any other 3 nucleotide (or less) palindrome exists, the 
value will be zero (0). 

The variable "X" refers to the number of times that the same nucleotide occurs 
5 contiguously in a stretch of four or more units. If there are, for example, four 
contiguous As in one part of the sequence and elsewhere in the sequence four 
contiguous Cs, X =2. Further, if there are two separate contiguous stretches of four of 
the same nucleotides or eight or more of the same nucleotides in a row, then X=2. 
However, X does not increase for five, six or seven contiguous nucleotides. 

10 

Again, when applying Formula VIII or Formula IX to a given mRNA, (the 
"target RNA" or "target molecule"), one may use a computer program to evaluate the 
criteria for every sequence of 18 - 30 base pairs or only sequences of a fixed length, 
e.g., 1 9 base pairs. Preferably the computer program is designed such that it provides 

15 a report ranking of all of the potential siRNAs between 1 8 and 30 base pairs, ranked 
according to which sequences generate the highest value. A higher value refers to a 
more efficient siRNA for a particular target gene. The computer program that may be 
used, may be developed in any computer language that is known to be useful for 
scoring nucleotide sequences, or it may be developed with the assistance of 

20 commercially available product such as Microsoft's product .net. Additionally, rather 
than run every sequence through one and/or another formula, one may compare a 
subset of the sequences, which maybe desirable if for example only a subset are 
available. For instance, it may be desirable to first perform a BLAST (BasicrioeaL. 
Alignment Search Tool) search and to identify sequences that have no homology to 

25 other targets. Alternatively, it may be desirable to scan the sequence and to identify 
regions of moderate GC context, then perform relevant calculations using one of the 
above-described formulas on these regions. These calculations can be done manually 
or with the aid of a computer. 

30 As with Formulas I - VII, either Formula VIII or Formula IX may be used for 

a given mRNA target sequence. However, it is possible that according to one or the 
other formula more than one siRNA will have the same value. Accordingly, it is 
beneficial to have a second formula by which to differentiate sequences. Formula IX 
was derived in a similar fashion as Formula VIII, yet used a larger data set and thus 
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yields sequences with higher statistical correlations to highly functional duplexes. The 
sequence that has the highest value ascribed to it may be referred to as a "first 
optimized duplex." The sequence that has the second highest value ascribed to it may 
be referred to as a "second optimized duplex." Similarly, the sequences that have the 
5 third and fourth highest values ascribed to them may be referred to as a third 

optimized duplex and a fourth optimized duplex, respectively. When more than one 
sequence has the same value, each of them may, for example, be referred to as first 
optimized duplex sequences or co-first optimized duplexes. 

10 SiRNA sequences identified using Formula VIII are contained within the 

enclosed compact disks. The data included on the enclosed compact disks is 
described more fully below. The sequences identified by Formula VIII that are 
disclosed in the compacts disks may be used in gene silencing applications. 

15 It should be noted that for Formulas VIII and IX all of the aforementioned 

criteria are identified as positions on the sense strand when oriented in the 5' to 3' 
direction as they are identified in connection with Formulas I — VII unless otherwise 
specified. 

20 Formulas I - IX, may be usedia select or to evaluate one, or more than one, 

siRNA in order to optimize silencing. Preferably, at least two optimized siRNAs that 
have been selected according to at least one of these formulas are used to silence a 
gene, more preferably at least three and most preferably at least four. The siRNAs-^*, 
may be used individually or together in a pool or kit. Further, they may be applied to 

25 a cell simultaneously or separately. Preferably, the at least two siRNAs are applied 
simultaneously. Pools are particularly beneficial for many research applications. 
However, for therapeutics, it may be more desirable to employ a single 
hyperfunctional siRNA as described elsewhere in this application. 

30 When planning to conduct gene silencing, and it is necessary to choose 

between two or more siRNAs, one should do so by comparing the relative values 
when the siRNA are subjected to one of the formulas above. In general a higher 
scored siRNA should be used. 
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Useful applications include, but are not limited to, target validation, gene 
functional analysis, research and drug discovery, gene therapy and therapeutics. 
Methods for using siRNA in these applications are well known to persons of skill in 
the art. 

5 

Because the ability of siRNA to function is dependent on the sequence of the 
RNA and not the species into which it is introduced, the present invention is 
applicable across a broad range of species, including but not limited to all mammalian 
species, such as humans, dogs, horses, cats, cows, mice, hamsters, chimpanzees and 
10 gorillas, as well as other species and organisms such as bacteria, viruses, insects, 
plants and C. elegans. 

The present invention is also applicable for use for silencing a broad range of 
genes, including but not limited to the roughly 45,000 genes of a human genome, and 
15 has particular relevance in cases where those genes are associated with diseases such 
as diabetes, Alzheimer's, cancer, as well as all genes in the genomes of the 
aforementioned organisms. 

The siRNA selected according to the aforementioned criteria or one of the 
aforementioned algorithms are also, for example, useful in the simultaneous screening 
and functional analysis of multiple genes and gene families using high throughput 
strategies, as well as in direct gene suppression or silencing. 

Development of the Algorithms 

To identify siRNA sequence features that promote functionality and to 
quantify the importance of certain currently accepted conventional factors — such as 
G/C content and target site accessibility — the inventors synthesized an siRNA panel 
consisting of 270 siRNAs targeting three genes, Human Cyclophilin, Firefly 
Luciferase, and Human DBL In all three cases, siRNAs were directed against specific 
regions of each gene. For Human Cyclophilin and Firefly Luciferase, ninety siRNAs 
were directed against a 199 bp segment of each respective mRNA. For DBI, 90 
siRNAs were directed against a smaller, 109 base pair region of the mRNA. The 
sequences to which the siRNAs were directed are provided below. 



20 



25 



30 
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It should be noted that in certain sequences, "t" is present. This is because 
many databases contain information in this manner. However, the t denotes a uracil 
residue in mRNA and siRNA. Any algorithm will, unless otherwise specified, 
process a t in a sequence as a u. 

5 

Human cvclophilin: 193 — 390, M60857 
SEQ. ID NO. 29: 

gttccaaaaacagtggataattttgtggccttagctacaggagagaaaggatttggctacaaaaacagcaaattccatcgtgt 
aatcaaggacttcatgatccagggcggagacttcaccaggggagatggcacaggaggaaagagcatctacggtgagcg 
1 0 cttccccgatgagaacttcaaactgaagcactacgggcctggctggg 



Firefly luciferase: 1434— 163 L U47298 (pGL3. Promega ^) 
SEQ. ID NO. 30: 

tgaacttcccgccgccgttgttgttttggagcacggaaagacgatgacggaaaaagagatcgtggattacgtcgccagtca 
15 agtaacaaccgcgaaaaagttgcgcggaggagttgtgtttgtggacgaagtaccgaaaggtcttaccggaaaactcgacg 
caagaaaaatcagagagatcctcataaaggccaagaagg 

DBL NM 020548 ( 202-310) (every position^ 
SEQ. ID NO. 0031: 

20 acgggcaaggccaagtgggatgcctggaatgagctgaaagggacttccaaggaagatgccatgaaagcttacatcaaca 
aagtagaagagctaaagaaaaaatacggg 

A list of the siRNAs appears in Table III (see Examples Section, Example II) 

25 The set of duplexes was analyzed to identify correlations between siRNA 

functionality and other biophysical or thermodynamic properties. When the siRNA 
panel was analyzed in functional and non- functional subgroups, certain nucleotides 
were much more abundant at certain positions in functional or non-functional groups. 
More specifically, the frequency of each nucleotide at each position in highly 

30 functional siRNA duplexes was compared with that of nonfunctional duplexes in 
order to assess the preference for or against any given nucleotide at every position. 
These analyses were used to determine important criteria to be included in the siRNA 
algorithms (Formulas VIII and IX). 
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The data set was also analyzed for distinguishing biophysical properties of 
siRNAs in the functional group, such as optimal percent of GC content, propensity for 
internal structures and regional thermodynamic stability. Of the presented criteria, 
several are involved in duplex recognition, RISC activation/duplex unwinding, and 
5 target cleavage catalysis. 



The original data set that was the source of the statistically derived criteria is 
shown in Figure 2. Additionally, this figure shows that random selection yields 
siRNA duplexes with unpredictable and widely varying silencing potencies as 

10 measured in tissue culture using HEK293 cells. In the figure, duplexes are plotted 

such that each x-axis tick-mark represents an individual siRNA, with each subsequent 
siRNA differing in target position by two nucleotides for Human Cyclophilin and 
Firefly Luciferase, and by one nucleotide for Human DBL Furthermore, the y-axis 
denotes the level of target expression remaining after transfection of the duplex into 

1 5 cells and subsequent silencing of the target. 

SiRNA identified and optimized in this document work equally well in a wide 
range of cell types. Figure 3a shows the evaluation of thirty siRNAs targeting the 
DBI gene in three cell lines derived from different tissues. Each DBI siRNA displays 

20 very similar functionality in HEK293 (ATCC, CRE-1 573, human embryonic kidney), 
HeLa (ATCC, CCL-2, cervical epithelial adenocarcinoma) and DU145 (HTB-81, 
prostate) cells as deterimined by the B-DNA assay. Thus, siRNA functionality is 
- determined by the primary sequence of the siRNA and not by the intracellular 
environment. Additionally, it should be noted that although the present invention 

25 provides for a determination of the functionality of siRNA for a given target, the same 
siRNA may silence more than one gene. For example, the complementary sequence 
of the silencing siRNA may be present in more than one gene. Accordingly, in these 
circumstances, it maybe desirable not to use the siRNA with highest SMARTscore™. 
In such circumstances, it may be desirable to use the siRNA with the next highest 

30 SMARTscore™. 



To determine the relevance of G/C content in siRNA function, the G/C content 
of each duplex in the panel was calculated and the functional classes of siRNAs 
(<F50, > F50, > F80, > F95 where F refers to the percent gene silencing) were sorted 
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accordingly. The majority of the highly-functional siRNAs (>F95) fell within the 
G/C content range of 36% — 52% (Figure 3B). Twice as many non-functional (< 
F50) duplexes fell within the high G/C content groups (>57% GC content) compared 
to the 36% — 52% group. The group with extremely low GC content (26% or less) 

5 contained a higher proportion of non-functional siRNAs and no highly-functional 
siRNAs. The G/C content range of 30% — 52% was therefore selected as Criterion I 
for siRNA functionality, consistent with the observation that a G/C range 30% — 70% 
promotes efficient RNAi targeting. Application of this criterion alone provided only a 
marginal increase in the probability of selecting functional siRNAs from the panel: 

1 0 selection of F50 and F95 siRNAs was improved by 3.6% and 2.2%, respectively. The 
siRNA panel presented here permitted a more systematic analysis and quantification 
of the importance of this criterion than that used previously. 

A relative measure of local internal stability is the A/U base pair (bp) content; 

1 5 therefore, the frequency of A/U bp was determined for each of the five terminal 
positions of the duplex (5' sense (S)/5' antisense (AS)) of all siRNAs in the panel. 
Duplexes were then categorized by the number of A/U bp in positions 1 — 5 and 15 — 
19 of the sense strand. The thermodynamic flexibility of the duplex 5 '-end (positions 
1 — 5 ; s) did not appear to correlate appreciably with silencing potency, while that of 

20 the 3'-end (positions 15 — 19; S) correlated with efficient silencing. No duplexes 

lacking A/U bp in positions 15 — 19 were functional. The presence of one A/U bp in 
this region conferred some degree of functionality, but the presence of three or more 
A/Usrwas preferable and therefore defined as Criterion II. When applied to the test 
panel, only a marginal increase in the probability of functional siRNA selection was 

25 achieved: a 1 .8% and 2.3% increase for F50 and F95 duplexes, respectively (Table 
IV). 

The complementary strands of siRNAs that contain internal repeats or 
palindromes may form internal fold-back structures. These hairpin-like structures 
30 exist in equilibrium with the duplexed form effectively reducing the concentration of 
functional duplexes. The propensity to form internal hairpins and their relative 
stability can be estimated by predicted melting temperatures. High Tm reflects a 
tendency to form hairpin structures. Lower Tm values indicate a lesser tendency to 
form hairpins. When the functional classes of siRNAs were sorted by T m (Figure 3c), 
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the following trends were identified: duplexes lacking stable internal repeats were the 
most potent silencers (no F95 duplex with predicted hairpin structure T m > 60 °C). In 
contrast, about 60% of the duplexes in the groups having internal hairpins with 
calculated T m values less than 20 °C were F80. Thus, the stability of internal repeats 
5 is inversely proportional to the silencing effect and defines Criterion III (predicted 
hairpin structure T m < 20 °C). 

Sequence-based determinants of siRNA functionality 

When the siRNA panel was sorted into functional and non-functional groups, 

10 the frequency of a specific nucleotide at each position in a functional siRNA duplex 
was compared with that of a nonfunctional duplex in order to assess the preference for 
or against a certain nucleotide. Figure 4 shows the results of these queries and the 
subsequent resorting of the data set (from Figure 2). The data is separated into two 
sets: those duplexes that meet the criteria, a specific nucleotide in a certain position - 

15 grouped on the left (Selected) and those that do not - grouped on the right 

(Eliminated). The duplexes are further sorted from most functional to least functional 
with the y-axis of Figure 4a-e representing the % expression i.e. the amount of 
silencing that is elicited by the duplex (Note: each position on the X-axis represents a 
different duplex). Statistical analysis revealed correlations between silencing and 

20 several sequence-related properties of siRNAs. FigureJrand Table IV show 

quantitative analysis for the following five sequence-related properties of siRNA: (A) 
an A at position 19 of the sense strand; (B) an A at position 3 of the sense strand; (C) 
a U at position 1 0 of the sense strand; (D) a base other than G at position 13 of the 
sense strand; and (E) a base other than C at position 19 of the sense strand. 

25 

When the siRNAs in the panel were evaluated for the presence of an A at 
position 19 of the sense strand, the percentage of non-functional duplexes decreased 
from 20% to 1 1.8%, and the percentage of F95 duplexes increased from 21.7% to 
29.4% (Table IV). Thus, the presence of an A in this position defined Criterion IV. 

30 

Another sequence-related property correlated with silencing was the presence 
of an A in position 3 of the sense strand (Figure 4b). Of the siRNAs with A3, 34.4% 
were F95, compared with 21 .7% randomly selected siRNAs. The presence of a U 
base in position 10 of the sense strand exhibited an even greater impact (Figure 4c). 
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Of the duplexes in this group, 41 .7% were F95. These properties became criteria V 
and VI 5 respectively. 



Two negative sequence-related criteria that were identified also appear on 
5 Figure 4. The absence of a G at position 13 of the sense strand, conferred a marginal 
increase in selecting functional duplexes (Figure 4d). Similarly, lack of a C at 
position 19 of the sense strand also correlated with functionality (Figure 4e). Thus, 
among functional duplexes, position 19 was most likely occupied by A, and rarely 
occupied by C. These rules were defined as criteria VII and VIII, respectively. 

10 

Application of each criterion individually provided marginal but statistically 
significant increases in the probability of selecting a potent siRNA. Although the 
results were informative, the inventors sought to maximize potency and therefore 
consider multiple criteria or parameters. Optimization is particularly important when 

15 developing therapeutics. Interestingly, the probability of selecting a functional siRNA 
based on each thermodynamic criteria was 2% — 4% higher than random, but 4% — 
8% higher for the sequence-related determinates. Presumably, these sequence-related 
increases reflect the complexity of the RNAi mechanism and the multitude of protein- 
RNA interactions that are involved in RNAi -mediated silencing. 

20 -^f 
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Table IV 



Criterion 


% Functional 


Improvement 
over Random 


1. 30%— 52% G/C content 


< F50 

> F50 

> F80 

> F95 


16.4% 
83.6% 
60.4% 
23.9% 


-3.6% 
3.6% 
4.3% 
2.2% 


11. At least 3 A/U bases at positions 
15 — 1 9 of the sense strand 


< F50 

> F50 

> F80 

> F95 


18.2% 
81 .8% 
59.7% 
24.0% 


-1.8% 
1 .8% 
3^6% 
2.3% 


III. Absence of interna! repeats, 
as measured by T m of 
secondary structure < 20°C 


< F50 

> F50 

> F80 

> F95 


16.7% 
83.3% 
61.1% 
24.6% 


-3.3% 
3.3% 
5.0% 
2.9% 


IV. An A base at position 19 
of the sense strand 


< F50 

> F50 

> F80 

> F95 


11.8% 
88.2% 
75.0% 
29.4% 


-8.2% 
8 9% 

KJ.£- /O 

18.9% 
7.7% 


V. An A base at position 3 
of the ^en^c* strand 


<F50 

> F50 

> F80 
>F95 


17.2% 
82.8% 
62.5% 
34.4% 


-2.8% 

£..0 /O 

6.4% 
12.7% 


VI. A U base at position 10 
of the sense strand 


< F50 

> F50 

> F80 
>F95 


13.9% 
86.1% 
69.4% 
41.7% 


-6.1% 

6.1% 

13.3% 

20% 


VII. A base other than C at 
position 19 of the sense strand 


< F50 
> F50 
>F80 
>F95 


18.8% 
81.2% 
59.7% 
24.2% 


-1.2% 
1.2% 
3.6% 
2.5% 


VIII. A base other than G at 
position 13 of the sense strand 


<F50 

> F50 

> F80 

> F95 


15.2% 
84.8% 
61.4% 
26.5% 


-4.8% 
4.8% 
5.3% 
4.8% 



The siRNA selection algorithm 

In an effort to improve selection further, all identified criteria, including but 
not limited to those listed in Table IV were combined into the algorithms embodied in 
Formula VIII and Formula IX. Each siRNA was then assigned a score (referred to as 
a SMARTscore™) according to the values derived from the formulas. Duplexes that 
scored higher than 0 or 20, for Formulas VIII and IX, respectively, effectively 
selected a set of functional siRNAs and excluded all non-functional siRNAs. 
Conversely, all duplexes scoring lower than 0 and 20 according to formulas VIII and 
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IX, respectively, contained some functional siRNAs but included all non-functional 
siRNAs. A graphical representation of this selection is shown in Figure 5. 

The methods for obtaining the seven criteria embodied in Table IV are 
5 illustrative of the results of the process used to develop the information for Formulas 
VIII and IX. Thus similar techniques were used to establish the other variables and 
their multipliers. As described above, basic statistical methods were use to determine 
the relative values for these multipliers. 

10 To determine the value for "Improvement over Random" the difference in the 

frequency of a given attribute (e.g. GC content, base preference) at a particular 
position is determined between individual functional groups (e.g. <F50) and the total 
siRNA population studied (e.g. 270 siRNA molecules selected randomly). Thus, for 
instance, in Criterion I (30%-52% GC content) members of the <F50 group were 

1 5 observed to have GC contents between 30-52% in 16.4% of the cases. In contrast, the 
total group of 270 siRNAs had GC contents in this range, 20% of the time. Thus for 
this particular attribute, there is a small negative correlation between 30%-52% GC 
content and this functional group (i.e. 16.4%-20% = -3.6%). Similarly, for Criterion 
VI, (a "U" at position 10 of the sense strand), the >F95 group contained a "U" at this 

20 position 41 .7% of the time. In contrast, the total group of 270 siRNAs had a "U" at 

this position 21 .7% of the time, thus the improvement over random is calculated to be 
20% (or 41.7%-21.7%). 

Identifying The Average Internal Stability Profile of Strong siRNA 
25 In order to identify an internal stability profile that is characteristic of strong 

siRNA, 270 different siRNAs derived from the cyclophilin B, the diazepam binding 
inhibitor (DBI), and the luciferase gene were individually transfected into HEK293 
cells and tested for their ability to induce RNAi of the respective gene. Based on their 
performance in the in vivo assay, the sequences were then subdivided into three 
30 groups, (i) >95% silencing; (ii) 80-95% silencing; and (iii) less than 50% silencing. 

Sequences exhibiting 51-84% silencing were eliminated from further consideration to 
reduce the difficulties in identifying relevant thermodynamic patterns. 
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Following the division of siRNA into three groups, a statistical analysis was 
performed on each member of each group to determine the average internal stability 
profile (AISP) of the siRNA. To accomplish this the Oligo 5.0 Primer Analysis 
Software and other related statistical packages (e.g. Excel) were exploited to 
5 determine the internal stability of pentamers using the nearest neighbor method 

described by Freier et aL, (1986) Improved free-energy parameters for predictions of 
RNA duplex stability, Proc Natl. Acad. ScL U. S. A. 83(24): 9373-7. Values for each 
group at each position were then averaged, and the resulting data were graphed on a 
linear coordinate system with the Y-axis expressing the AG (free energy) values in 
10 kcal/mole and the X-axis identifying the position of the base relative to the 5 5 end. 



The results of the analysis identified multiple key regions in siRNA molecules 
that were critical for successful gene silencing. At the 3 5 -most end of the sense strand 
(5'antisense), highly functional siRNA (>95% gene silencing, see Figure 6a, >F95) 

1 5 have a low internal stability (AISP of position 19 = - -7.6kcal/mol). In contrast low- 
efficiency siRNA (i.e. those exhibiting less than 50% silencing, <F50) display a 
distinctly different profile, having high AG values (~ -8.4kcal/mol) for the same 
position. Moving in a 5' (sense strand) direction, the internal stability of highly 
efficient siRNA rises (position 12 = — -8.3kcal/mole) and then drops again (position 7 

20 = ~ -7.7kcal/mol) before leveling off at a value of approximately — 8.1keal/mol for the 
5' terminus. SiRNA with poor silencing capabilities show a distinctly different 
profile. While the AISP value at position 12 is nearly identical with that of strong 
siRNAs, the values at positions 7 and 8 rise considerably, peaking at a high of — -9.0 
kcal/mol. In addition, at the 5 5 end of the molecule the AISP profile of strong and 

25 weak siRNA differ dramatically. Unlike the relatively strong values exhibited by 

siRNA in the >95% silencing group, siRNAs that exhibit poor silencing activity have 
weak AISP values (-7.6, -7.5, and -7.5 kcal/mol for positions 1, 2 and 3 respectively). 

Overall the profiles of both strong and weak siRNAs form distinct sinusoidal 
30 shapes that are roughly 1 80° out-of-phase with each other. While these 

thermodynamic descriptions define the archetypal profile of a strong siRNA, it will 
likely be the case that neither the AG values given for key positions in the profile or 
the absolute position of the profile along the Y-axis (i.e. the AG -axis) are absolutes. 
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Profiles that are shifted upward or downward {i.e. having on an average, higher or 
lower values at every position) but retain the relative shape and position of the profile 
along the X-axis can be foreseen as being equally effective as the model profile 
described here. Moreover, it is likely that siRNA that have strong or even stronger 
5 gene-specific silencing effects might have exaggerated AG values (either higher or 
lower) at key positions. Thus, for instance, it is possible that the 5 5 -most position of 
the sense strand (position 19) could have AG values of 7.4 kcal/mol or lower and still 
be a strong siRNA if, for instance, a G-C -> G-T/U mismatch were substituted at 
position 19 and altered duplex stability. Similarly, position 12 and position 7 could 

10 have values above 8.3 kcal/mol and below 7.7 kcal/mole, respectively, without 

abating the silencing effectiveness of the molecule. Thus, for instance, at position 12, 
a stabilizing chemical modification (e.g. a chemical modification of the 2' position of 
the sugar backbone) could be added that increases the average internal stability at that 
position. Similarly, at position 7, mismatches similar to those described previously 

1 5 could be introduced that would lower the AG values at that position. 

Lastly, it is important to note that while functional and non-functional siRNA 
were originally defined as those molecules having specific silencing properties, both 
broader or more limiting parameters can be used to define these molecules. As used 

20 herein, unless otherwise specified, "non-functional siRNA" are defined as those 

siRNA that induce less than 50% (<50%) target silencing, "semi-functional siRNA 55 
induce 50-79% target silencing, "functional siRNA 55 are molecules that induce 80- 
95% gene silencing, and 7 "highly-functional siRNA 55 are molecules that induce great 
than 95% gene silencing. These definitions are not intended to be rigid and can vary 

25 depending upon the design and needs of the application. For instance, it is possible 
that a researcher attempting to map a gene to a chromosome using a functional assay, 
may identify an siRNA that reduces gene activity by only 30%. While this level of 
gene silencing may be "non-functional 55 for e.g. therapeutic needs, it is sufficient for 
gene mapping purposes and is, under these uses and conditions, "functional. 55 For 

30 these reasons, functional siRNA can be defined as those molecules having greater 
than 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, or 90% silencing capabilities at 
lOOnM transfection conditions. Similarly, depending upon the needs of the study 
and/or application, non-functional and semi-functional siRNA can be defined as 
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having different parameters. For instance, semi-functional siRNA can be defined as 
being those molecules that induce 20%, 30%, 40%, 50%, 60%, or 70% silencing at 
lOOnM transfection conditions. Similarly, non-functional siRNA can be defined as 
being those molecules that silence gene expression by less than 70%, 60%, 50%, 
5 40%, 30%o, or less. Nonetheless, unless otherwise stated, the descriptions stated in 
the "Definitions" section of this text should be applied. 

Functional attributes can be assigned to each of the key positions in the AISP 
of strong siRNA. The low 5 5 (sense strand) AISP values of strong siRNAs may be 
10 necessary for determining which end of the molecule enters the RISC complex. In 

contrast, the high and low AISP values observed in the central regions of the molecule 
may be critical for siRNA-target mRNA interactions and product release, 
respectively. 

15 If the AISP values described above accurately define the thermodynamic 

parameters of strong siRNA, it would be expected that similar patterns would be 
observed in strong siRNA isolated from nature. Natural siRNAs exist in a harsh, 
RNase-rich environment and it can be hypothesized that only those siRNA that 
exhibit heightened affinity for RISC {i.e. siRNA that exhibit an average internal 

20 stability profile similar to those observed in strong siRNA) would survive in an 
intracellular environment. This hypothesis was tested using GFP-specific siRNA 
isolated from N. benthamiana. Llave et al. (2002) Endogenous and Silencing- 
Associated Small RNAs in Plants, Plant Cell 14, 1605-1619, introduced long 
double-stranded GFP-encoding RNA into plants and subsequently re-isolated GFP- 

25 specific siRNA from the tissues. The AISP of fifty-nine of these GFP-siRNA were 
determined, averaged, and subsequently plotted alongside the AISP profile obtained 
from the cyclophilin B/DBI/ luciferase siRNA having >90% silencing properties 
(Figure 6b). Comparison of the two groups show that profiles are nearly identical. 
This finding validates the information provided by the internal stability profiles and 

30 demonstrates that: (1) the profile identified by analysis of the cyclophilin B/DBI/ 

luciferase siRNAs are not gene specific; and (2) AISP values can be used to search for 
strong siRNAs in a variety of species. 
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Both chemical modifications and base-pair mismatches can be incorporated 
into siRNA to alter the duplex's AISP and functionality. For instance, introduction of 
mismatches at positions 1 or 2 of the sense strand destabilized the 5 5 end of the sense 
strand and increases the functionality of the molecule (see Luc, Figure 7). Similarly, 
5 addition of 2'-0-methyl groups to positions 1 and 2 of the sense strand can also alter 
the AISP and (as a result) increase both the functionality of the molecule and 
eliminate off-target effects that results from sense strand homology with the unrelated 
targets (Figures 8a, 8b). 

10 Rationale for Criteria in a Biological Context 

The fate of siRNA in the RNAi pathway may be described in 5 major steps: 
(1) duplex recognition and pre-RISC complex formation; (2) ATP-dependent duplex 
unwinding/strand selection and RISC activation; (3) mRNA target identification; (4) 
mRNA cleavage, and (5) product release (Figure 1). Given the level of nucleic acid- 

1 5 protein interactions at each step, siRNA functionality is likely influenced by specific 
biophysical and molecular properties that promote efficient interactions within the 
context of the multi-component complexes. Indeed, the systematic analysis of the 
siRNA test set identified multiple factors that correlate well with functionality. When 
combined into a single algorithm, they proved to be very effective in selecting active 

20 siRNAs. 

The factors described here may also be predictive of key functional 
associations important for each step4nJ2NAi. For example, the potential formation of 
internal hairpin structures correlated negatively with siRNA functionality. 

25 Complementary strands with stable internal repeats are more likely to exist as stable 
hairpins thus decreasing the effective concentration of the functional duplex form. 
This suggests that the duplex is the preferred conformation for initial pre-RISC 
association. Indeed, although single complementary strands can induce gene 
silencing, the effective concentration required is at least two orders of magnitude 

30 higher than that of the duplex form. 

siRNA-pre-RISC complex formation is followed by an ATP-dependent duplex 
unwinding step and "activation" of the RISC. The siRNA functionality was shown to 
correlate with overall low internal stability of the duplex and low internal stability of 
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the 3' sense end (or differential internal stability of the 3' sense compare to the 5 5 
sense strand), which may reflect strand selection and entry into the RISC. Overall 
duplex stability and low internal stability at the 3' end of the sense strand were also 
correlated with siRNA functionality. Interestingly, siRNAs with very high and very 
5 low overall stability profiles correlate strongly with non-functional duplexes. One 
interpretation is that high internal stability prevents efficient unwinding while very 
low stability reduces siRNA target affinity and subsequent mRNA cleavage by the 
RISC. 



10 Several criteria describe base preferences at specific positions of the sense 

strand and are even more intriguing when considering their potential mechanistic 
roles in target recognition-and mRNA cleavage. Base preferences for A at position 19 
of the sense strand but not C 5 are particularly interesting because they reflect the same 
base preferences observed for naturally occurring miRNA precursors. That is, among 

15 the reported miRNA precursor sequences 75% contain a U at position 1 which 

corresponds to an A in position 19 of the sense strand of siRNAs, while G was under- 
represented in this same position for miRNA precursors. These observations support 
the hypothesis that both miRNA precursors and siRNA duplexes are processed by 
very similar if not identical protein machinery. The functional interpretation of the 

20 predominance of a U/A base pair is that it promotes flexibility at the 5'antisense ends 5 
of both siRNA duplexes and miRNA precursors and facilitates efficient unwinding 
and selective strand entrance into an activated RISC. 

Among the criteria associated with base preferences that are likely to 
25 influence mRNA cleavage or possibly product release, the preference for U at 
position 10 of the sense strand exhibited the greatest impact, enhancing the 
probability of selecting an F80 sequence by 13.3%. Activated RISC preferentially 
cleaves target mRNA between nucleotides 10 and 1 1 relative to the 5 5 end of the 
complementary targeting strand. Therefore, it may be that U, the preferred base for 
30 most endoribonucleases, at this position supports more efficient cleavage. 

Alternatively, a U/A bp between the targeting siRNA strand and its cognate target 
mRNA may create an optimal conformation for the RISC-associated "slicing" 
activity. 
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According to another embodiment, the present invention provides a pool of at 
least two siRNAs, preferably in the form of a kit or therapeutic reagent, wherein one 
strand of each of the siRNAs, the sense strand comprises a sequence that is 
5 substantially similar to a sequence within a target mRNA. The opposite strand, the 
antisense strand, will preferably comprise a sequence that is substantially 
complementary to that of the target mRNA. More preferably, one strand of each 
siRNA will comprise a sequence that is identical to a sequence that is contained in the 
target mRNA. Most preferably, each siRNA will be 19 base pairs in length, and one 
10 strand of each of the siRNAs will be 100% complementary to a portion of the target 
mRNA. 

By increasing the number of siRNAs directed to a particular target using a 
pool or kit, one is able both to increase the likelihood that at least one siRNA with 

15 satisfactory functionality will be included, as well as to benefit from additive or 

synergistic effects. Further, when two or more siRNAs directed against a single gene 
do not have satisfactory levels of functionality alone, if combined, they may 
satisfactorily promote degradation of the target messenger RNA and successfully 
inhibit translation. By including multiple siRNAs in the system, not only is the 

20 . probability of silencing increased, but the economics of operation are also improved 
when compared to adding different siRNAs sequentially. This effect is contrary to the 
conventional wisdom that the concurrent use of multiple siRNA will negatively 
impact gene silencing (e.g. Holen, T. etal^(2QD3) "Similar behavior of single strand 
and double strand siRNAs suggests they act through a common RNAi pathway." 

25 NAR 31: 2401-21407). 

In fact, when two siRNAs were pooled together, 54% of the pools of two 
siRNAs induced more than 95% gene silencing. Thus, a 2.5-fold increase in the 
percentage of functionality was achieved by randomly combining two siRNAs. 
30 Further, over 84% of pools containing two siRNAs induced more than 80% gene 
silencing. 

More preferably, the kit is comprised of at least three siRNAs, wherein one 
strand of each siRNA comprises a sequence that is substantially similar to a sequence 
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of the target mRNA and the other strand comprises a sequence that is substantially 
complementary to the region of the target mRNA. As with the kit that comprises at 
least two siRNAs, more preferably one strand will comprise a sequence that is 
identical to a sequence that is contained in the mRNA and another strand that is 100% 
5 complementary to a sequence that is contained in the mRNA. During experiments, 
when three siRNAs were combined together, 60% of the pools induced more than 
95% gene silencing and 92% of the pools induced more than 80% gene silencing. 

Further, even more preferably, the kit is comprised of at least four siRNAs, 
10 wherein one strand of each siRNA comprises a sequence that is substantially similar 
to a region of the sequence of the target mRNA, and the other strand comprises a 
sequence that is substantially complementary to the region of the target mRNA. As 
with the kit or pool that comprises at least two siRNAs, more preferably one strand of 
each of the siRNA duplexes will comprise a sequence that is identical to a sequence 
1 5 that is contained in the mRNA, and another strand that is 1 00% complementary to a 
sequence that is contained in the mRNA. 



Additionally, kits and pools with at least five, at least six, and at least seven 
siRNAs may also be useful with the present invention. For example, pools of five 

20 siRNA induced 95% gene silencing with 77% probability and 80% silencing with 
98.8% probability. Thus, pooling of siRNAs together can result in the creation of a 
target-specific silencing reagent with almost a 99% probability of being functional. 
The fact that such high levels of success are achievable using such pools of siRNA, 
enables one to dispense with costly and time-consuming target-specific validation 

25 procedures. 

For this embodiment, as well as the other aforementioned embodiments, each 
of the siRNAs within a pool will preferably comprise between 1 8 and 30 base pairs, 
more preferably between 18 and 25 base pairs, and most preferably 19 base pairs. 
30 Within each siRNA, preferably at least 18 contiguous bases of the antisense strand 
will be 100% complementary to the target mRNA. More preferably, at least 19 
contiguous bases of the antisense strand will be 100% complementary to the target 
mRNA. Additionally, there may be overhangs on either the sense strand or the 
antisense strand, and these overhangs may be at either the 5' end or the 3' end of 
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either of the strands, for example there may be one or more overhangs of 1-6 bases. 
When overhangs are present, they are not included in the calculation of the number of 
base pairs. The two nucleotide 3' overhangs mimic natural siRNAs and are 
commonly used but are not essential. Preferably, the overhangs should consist of two 
5 nucleotides, most often dTdT or UU at the 3 5 end of the sense and antisense strand 
that are not complementary to the target sequence. The siRNAs may be produced by 
any method that is now known or that comes to be known for synthesizing double 
stranded RNA that one skilled in the art would appreciate would be useful in the 
present invention. Preferably, the siRNAs will be produced by Dharmacon's 
10 proprietary ACE® technology. However, other methods for synthesizing siRNAs are 
well known to persons skilled in the art and include, but are not limited to, any 
chemical synthesis of RNA oligonucleotides, ligation of shorter oligonucleotides, in 
vitro transcription of RNA oligonucleotides, the use of vectors for expression within 
cells, recombinant Dicer products and PGR products. 

15 

The siRNA duplexes within the aforementioned pools of siRNAs may 
correspond to overlapping sequences within a particular mRNA, or non-overlapping 
sequences of the mRNA. However, preferably they correspond to non-overlapping 
sequences. Further, each siRNA may be selected randomly, or one or more of the 
20 siRNA lhay be selected according to the criteria discussed above for maximizing the 
effectiveness of siRNA. 

Included in the definition of siRNAs are siRMAs that contain substituted 
and/or labeled nucleotides that may, for example, be labeled by radioactivity, 

25 fluorescence or mass. The most common substitutions are at the 2' position of the 

ribose sugar, where moieties such as H (hydrogen) F, NH3, OCH 3 and other O- alkyl, 
alkenyl, alkynyl, and orthoesters, may be substituted, or in the phosphorous backbone, 
where sulfur, amines or hydrocarbons may be substituted for the bridging of non- 
bridging atoms jn the phosphodiester bond. Examples of modified siRNAs are 

30 explained more fully in commonly assigned U.S. Patent Application Ser. No. 
10/613,077, filed July 1, 2003, which is incorporated by reference herein. 

Additionally, as noted above, the cell type into which the siRNA is introduced 
may affect the ability of the siRNA to enter the cell; however, it does not appear to 
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affect the ability of the siRNA to function once it enters the cell. Methods for 
introducing double-stranded RNA into various cell types are well known to persons 
skilled in the art. 



5 As persons skilled in the art are aware, in certain species, the presence of 

proteins such as RdRP, the RNA-dependent RNA polymerase, may catalytically 
enhance the activity of the siRNA. For example, RdRP propagates the RNAi effect in 
C elegans and other non-mammalian organisms. In fact, in organisms that contain 
these proteins, the siRNA may be inherited. Two other proteins that are well studied 
1 0 and known to be a part of the machinery are members of the Argonaute family and 
Dicer, as well as their homologues. There is also initial evidence that the RISC 
complex might be associated with the ribosome so the more efficiently translated 
mRNAs will be more susceptible to silencing than others. 

1 5 Another very important factor in the efficacy of siRNA is mRNA localization. 

In general, only cytoplasmic mRNAs are considered to be accessible to RNAi to any 
appreciable degree. However, appropriately designed siRNAs, for example, siRNAs 
modified with internucleotide linkages, may be able to cause silencing by acting in the 
nucleus. Examples of these types of modifications are described in commonly 

20 assigned U.S. Patent Application Serial Nos. 10/431,027 and 10/613,077, each of 
which is incorporated by reference herein. 

As described above, even when one select&atJeasttwo siRNAs at random, the 
effectiveness of the two may be greater than one would predict based on the 

25 effectiveness of two individual siRNAs. This additive or synergistic effect is 
particularly noticeable as one increases to at least three siRNAs, and even more 
noticeable as one moves to at least four siRNAs. Surprisingly, the pooling of the non- 
functional and semi-functional siRNAs, particularly more than five siRNAs, can lead 
to a silencing mixture that is as effective if not more effective than any one particular 

30 functional siRNA. 
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Within the kit of the present invention, preferably each siRNA will be present 
in a concentration of between 0.001 and 200 pM, more preferably between 0.01 and 
200 nM, and most preferably between 0.1 and 10 nM. 

5 In addition to preferably comprising at least four or five siRNAs, the kit of the 

present invention will also preferably comprise a buffer to keep the siRNA duplex 
stable. Persons skilled in the art are aware of buffers suitable for keeping siRNA 
stable. For example, the buffer may be comprised of 100 mM KC1, 30 mM HEPES- 
pH 7.5, and 1 mM MgCl 2 . Alternatively, kits might contain complementary strands 
10 that contain any one of a number of chemical modifications (e.g. a 2'-OACE) that 
protect the agents from degradation by nucleases. In this instance, the user may (or 
may not) remove the modifying protective group (e.g. deprotect) before annealing the 
two complementary strands together. 

15 By way of example, the kit may be organized such that pools of siRNA 

duplexes are provided on an array or microarray of wells or drops for a particular 
gene set or for unrelated genes. The array may, for example, be in 96 wells, 384 wells 
or 1284 wells arrayed in a plastic plate or on a glass slide using techniques now 
known or that come to be known to persons skilled in the art. Within an array, 

20 preferably there^will be controls such as functional anti-lamin A/C, cyclophilin and 
two siRNA duplexes that are not specific to the gene of interest. 

In order to ensure stability of the siRNA pools^prior-4o-usage, they may be 
retained in lyophilized form at minus twenty degrees (— 20°C) until they are ready for 

25 use. Prior to usage, they should be resuspended; however, even once resuspended, for 
example, in the aforementioned buffer, they should be kept at minus twenty degrees, 
(— 20°C) until used. The aforementioned buffer, prior to use, may be stored at 
approximately 4°C or room temperature. Effective temperatures at which to conduct 
transfections are well known to persons skilled in the art and include for example, 

30 room temperature. 

The kit may be applied either in vivo or in vitro. Preferably, the siRNA of the 
pools or kits is applied to a cell through transfection, employing standard transfection 
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protocols. These methods are well known to persons skilled in the art and include the 
use of lipid-based carriers, electroporation, cationic carriers, and microinjection. 
Further, one could apply the present invention by synthesizing equivalent DNA 
sequences (either as two separate, complementary strands, or as hairpin molecules) 
5 instead of siRNA sequences and introducing them into cells through vectors. Once in 
the cells, the cloned DNA could be transcribed, thereby forcing the cells to generate 
the siRNA. Examples of vectors suitable for use with the present application include 
but are not limited to the standard transient expression vectors, adenoviruses, 
retroviruses, lentivirus-based vectors, as well as other traditional expression vectors. 
1 0 Any vector that has an adequate siRNA expression and procession module may be 
used. Furthermore, certain chemical modifications to siRNAs, including but not 
limited to conjugations to other molecules, may be used to facilitate delivery. For 
certain applications it may be preferable to deliver molecules without transfection by 
simply formulating in a physiological acceptable solution. 

15 

This embodiment may be used in connection with any of the aforementioned 
embodiments. Accordingly, the sequences within any pool maybe selected by 
rational design. 

Multigene Silencing 

In addition to developing kits that contain multiple siRNA directed against a 
single gene, another embodiment includes the use of multiple^siRMA targeting 
multiple genes. Multiple genes may be targeted through the use of high- or hyper- 
functional siRNA. High- or hyper- functional siRNA that exhibit increased potency, 
require lower concentrations to induce desired phenotypic (and thus therapeutic) 
effects. This circumvents RISC saturation. It therefore reasons that if lower 
concentrations of a single siRNA are needed for knockout or knockdown expression 
of one gene, then the remaining (uncomplexed) RISC will be free and available to 
interact with siRNA directed against two, three, four, or more, genes. Thus in this 
embodiment, the authors describe the use of highly functional or hyper-functional 
siRNA to knock out three separate genes. More preferably, such reagents could be 
combined to knockout four distinct genes. Even more preferably, highly functional or 
hyperfunctional siRNA could be used to knock out five distinct genes. Most 
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preferably, siRNA of this type could be used to knockout or knockdown the 
expression of six or more genes. 



Hyperfunctional siRNA 

5 The term hyperfunctional siRNA (hf-siRNA) describes a subset of the siRNA 

population that induces RNAi in cells at low- or sub-nanomolar concentrations for 
extended periods of time. These traits, heightened potency and extended longevity of 
the RNAi phenotype, are highly attractive from a therapeutic standpoint. Agents 
having higher potency require lesser amounts of the molecule to achieve the desired 
10 physiological response, thus reducing the probability of side effects due to "off- 
target" interference. In addition to the potential therapeutic benefits associated with 
hyperfunctional siRNA, hf-siRNA are also desirable ^from an economic position. 
Hyperfunctional siRNA may cost less on a per-treatment basis, thus reducing the 
overall expenditures to both the manufacturer and the consumer. 

15 Identification of hyperfunctional siRNA involves multiple steps that are 

designed to examine an individual siRNA agent's concentration- and/or longevity- 
profiles. In one non-limiting example, a population of siRNA directed against a single 
gene are first analyzed using the previously described algorithm (Formula VIII). 
Individual siRNA are then introduced into a test cell line and assessed for the ability 

20 to degrade the target mRNA. It is important to note that when performing this step it 
is not necessary to test all of the siRNA. Instead, it is sufficient to test only those 
siRNA having the highest SMARTscores™ (i.e. SMARTscore™ > -10). 
Subsequently, the gene silencing data is plotted against the SMARTscores™ (see 
Figure 9). SiRNA that (1) induce a high degree of gene silencing (i.e. they induce 

25 greater than 80% gene knockdown) and (2) have superior SMARTscores™ (i.e. a 

SMARTscore™ of > -10, suggesting a desirable average internal stability profile) are 
selected for further investigations designed to better understand the molecule's 
potency and longevity. In one, non-limiting study dedicated to understanding a 
molecule's potency, an siRNA is introduced into one (or more) cell types in 

30 increasingly diminishing concentrations (e.g. 3.0 -> 0.3 nM). Subsequently, the level 
of gene silencing induced by each concentration is examined and siRNA that exhibit 
hyperfunctional potency (i.e. those that induce 80% silencing or greater at e.g. 
picomolar concentrations) are identified. In a second study, the longevity profiles of 
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siRNA having high (>-10) SMARTscores™ and greater than 80% silencing are 
examined. In one non-limiting example of how this is achieved, siRNA are introduced 
into a test cell line and the levels of RNAi are measured over an extended period of 
time (e.g. 24-168 hrs). SiRNAs that exhibit strong RNA interference patterns {i.e. 
5 >80 % interference) for periods of time greater than, e.g., 120 hours are thus 

identified. Studies similar to those described above can be performed on any and all 
of the >10 6 siRNA included in this document to further define the most functional 
molecule for any given gene. Molecules possessing one or both properties (extended 
longevity and heightened potency) are labeled "hyperftmctional siRNA," and 
10 earmarked as candidates for future therapeutic studies. 

While the example(s) given above describe one means by which 
hyperfunctional siRNA can be isolated, neither the assays themselves nor the 
selection parameters used are rigid and can vary with each family of siRNA. Families 
15 of siRNA include siRNAs directed against a single gene, or directed against a related 
family of genes. 



The highest quality siRNA achievable for any given gene may vary 
considerably. Thus, for example, in the case of one gene (gene X), rigorous studies 

20 such as those described above may enable the identification of an siRNA that, at 
picomolar concentrations, induces 99 + % silencing for a period of 10 days. Yet 
identical studies of a second gene (gene Y) may yield an siRNA that at high 
nanomolar concentrations (e.g. 100nM) induces only 75% silencing, forua-period of 2 
days. Both molecules represent the very optimum siRNA for their respective gene 

25 targets and therefore are designated "hyperfunctional." Yet due to a variety of factors 
including but not limited to target concentration, siRNA stability, cell type, off-target 
interference, and others, equivalent levels of potency and longevity are not 
achievable. Thus, for these reasons, the parameters described in the before mentioned 
assays, can vary. While the initial screen selected siRNA that had SMARTscores™ 

3 0 above -1 0 and a gene silencing capability of greater than 80%, selections that have 
stronger (or weaker) parameters can be implemented. Similarly, in the subsequent 
studies designed to identify molecules with high potency and longevity, the desired 
cutoff criteria (i.e. the lowest concentration that induces a desirable level of 
interference, or the longest period of time that interference can be observed) can vary. 
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The experimentation subsequent to application of the rational criteria of this 
application is significantly reduced where one is trying to obtain a suitable 
hyperfunctional siRNA for, for example, therapeutic use. When, for example, the 
additional experimentation of the type described herein is applied by one skilled in the 
5 art with this disclosure in hand, a hyperfunctional siRNA is readily identified. 



The siRNA may be introduced into a cell by any method that is now known or 
that comes to be known and that from reading this disclosure, persons skilled in the 
art would determine would be useful in connection with the present invention in 

10 enabling siRNA to cross the cellular membrane. These methods include, but are not 
limited to, any manner of transfection, such as for example transfection employing 
DEAE-Dextran, calcium phosphate, cationic lipids/liposomes; micelles, manipulation 
of pressure, microinjection, electroporation, immunoporation, use of vectors such as 
viruses, plasmids, cosmids, bacteriophages, cell fusions, and coupling of the 

15 polynucleotides to specific conjugates or ligands such as antibodies, antigens, or 

receptors, passive introduction, adding moieties to the siRNA that facilitate its uptake, 
and the like. 

Having described the invention with a degree of particularity, examples will 
20 now be provided. These examples are not intended to and should not be construed to 
limit the scope of the claims in any way. 



Examples ^ z 

25 General Techniques and Nomenclatures 

siRNA nomenclature. All siRNA duplexes are referred to by sense strand. The first 
nucleotide of the 5' -end of the sense strand is position 1, which corresponds to 
position 19 of the antisense strand for a 19-mer. In most cases, to compare results 
from different experiments, silencing was determined by measuring specific transcript 
30 mRNA levels or enzymatic activity associated with specific transcript levels, 24 hours 
post-transfection, with siRNA concentrations held constant at 100 nM. For all 
experiments, unelss otherwise specified transfection efficiency was ensured to be over 
95%, and no detectable cellular toxicity was observed. The following system of 
nomenclature was used to compare and report siRNA-silencing functionality: "F" 
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followed by the degree of minimal knockdown. For example, F50 signifies at least 
50% knockdown, F80 means at least 80%, and so forth. For this study, all sub-F50 
siRNAs were considered non-functional. 



5 Cell culture and transfection. 96-well plates are coated with 50 jul of 50 mg/ml poly- 
L-lysine (Sigma) for 1 hr, and then washed 3X with distilled water before being dried 
for 20 min. HEK293 cells or HEK293Lucs or any other cell type of interest are 
released from their solid support by trypsinization, diluted to 3.5 X 10 5 cells/ml, 
followed by the addition of 100 p,L of cells/well. Plates are then incubated overnight 

10 at 37° C, 5% CO2. Transfection procedures can vary widely depending on the cell 
type and transfection reagents. In one non-limiting example, a transfection mixture 
consisting of 2 mL Opti-MEM I (Gibco-BRL), 80 jlxI Lipofectamme 2000 
(Invitrogen), 15 \xL SUPERNasin at 20 U/jal (Ambion), and 1.5 jlxI of reporter gene 
plasmid at 1 \ig/ jlxI is prepared in 5 -ml polystyrene round bottom tubes. 100 jlxI of 

1 5 transfection reagent is then combined with 100 pi of siRNAs in polystyrene deep-well 
titer plates (Beckman) and incubated for 20 to 30 min at room temp. 550 jlxI of Opti- 
MEM is then added to each well to bring the final siRNA concentration to 100 nM. 
Plates are then sealed with parafilm and mixed. Media is removed from HEK293 
cells and replaced with 95 jul of transfection mixture. Cells are incubated overnight at 

20 37° C, 5%C0 2 . 

Quantification of gene knockdown. A variety of quantification procedures can be 
used to measure the level of silencing induced by siRNA or siRNA pools. In one non- 
limiting example: to measure mRNA levels 24 hrs post-transfection, QuantiGene 
25 branched-DNA (bDNA) kits (Bayer) (Wang, et al, Regulation of insulin preRNA 
splicing by glucose, Proc Natl Acad Sci 1997, 94:4360.) are used according to 
manufacturer instructions. To measure luciferase activity, media is removed from 
HEK293 cells 24 hrs post-transfection, and 50 \x\ of Steady-GLO reagent (Promega) 
is added. After 5 min, plates are analyzed on a plate reader. 

30 

Example I. Sequences Used to Develop the Algorithm. 

Anti-Firefly and anti-Cyclophilin siRNAs panels (Figure 5a, b) sorted 
according to using Formula VIII predicted values. All siRNAs scoring more than 0 
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(formula VIII) and more then 20 (formula IX) are folly functional. All ninety 
sequences for each gene (and DBI) appear below in Table III. 



TABLE III 



Cyclo 


1 


SEQ. ID 0032 


GUUCCAAAAACAGUGGAUA 


Cyclo 


2 


SEQ. ID 0033 


UCCAAAAACAGUGGAUAAU 


Cyclo 


3 


SEQ. ID 0034 


CAAAAACAGUGGAUAAUUU 


Cyclo 


4 


SEQ. ID 0035 


AAAACAGUGGAUAAUUUUG 


Cyclo 


5 


SEQ. ID 0036 


AACAGUGGAUAAULTUUGUG 


Cyclo 


6 


SEQ. ID 0037 


CAGUGGAUAAUUUUGUGGC 


Cyclo 


7 


SEQ. ID 0038 


GUGGAUAAUUUUGUGGCCU 


Cyclo 


8 


SEQ. ID 0039 


GGAUAAUUUUGUGGCCUUA 


Cyclo 


9 


SEQ. ID 0040 


AUAAUUUUGUGGCCUUAGC 


Cyclo 


10 


SEQ. ID 0041 


AAUUUUGUGGCCUUAGCUA 


Cyclo 


11 


SEQ. ID 0042 


UUUUGUGGCCUUAGCUACA 


Cyclo 


12 


SEQ. ID 0043 


UUGUGGCCUUAGCUACAGG 


Cyclo 


13 


SEQ. ID 0044 


GUGGCCUUAGCUACAGGAG 


Cyclo 


14 


SEQ. ID 0045 


GGCCUUAGCUACAGGAGAG 


Cyclo 


15 


SEQ. ID 0046 


CCUUAGCUACAGGAGAGAA 


Cyclo 


16 


SEQ. ID 0047 


UUAGCUACAGGAGAGAAAG 


Cyclo 


17 


SEQ. ID 0048 


AGCUACAGGAGAGAAAGGA 


Cyclo 


18 


SEQ. ID 0049 


CUAC AGGAGAGAAAGGAUU 


Cyclo 


19 


SEQ. ID 0050 


ACAGGAGAGAAAGGAUUUG 


Cyclo 


20 


SEQ. ID 0051 


AGGAGAGAAAGGAUUUGGC 


Cyclo 


21 


SEQ. ID 0052 


GAGAGAAAGGAUUUGGCUA 


Cyclo 


22 


SEQ. ID 0053 


GAGAAAGGAUUUGGCUACA 


Cyclo 


23 


SEQ. ID 0054 


GAAAGGAUUUGGCUACAAA 


Cyclo 


24 


SEQ. ID 0055 


AAGGAUUUGGCUACAAAAA 


Cyclo 


25 


SEQ. ID 0056 


GGAUUUGGCUACAAAAACA 


Cyclo 


26 


SEQ. ID 0057 


AUUUGGCUACAAAAACAGC 


Cyclo 


27 


SEQ. ID 0058 


UUGGCUACAAAAACAGCAA 


Cyclo 


28 


SEQ. ID 0059 


GGCUACAAAAACAGCAAAU 


Cyclo 


29 


SEQ. ID 0060 


CUACAAAAACAGCAAAUUC 


Cyclo 


30 


SEQ. ID 0061 


ACAAAAACAGCAAAUUCCA 


Cyclo 


31 


SEQ. ID 0062 


AAAAACAGCAAAUUCCAUC 


Cyclo 


32 


SEQ. ID 0063 


AAACAGCAAAUUCCAUCGU 


Cyclo 


33 


SEQ. ID 0064 


ACAGCAAAUUCCAUCGUGU 


Cyclo 


34 


SEQ. ID 0065 


AGCAAAUUCCAUCGUGUAA 
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Cyclo 


35 


SEQ. ID 0066 


CAAAUUCCAUCGUGUAAUC 


Cyclo 


36 


SEQ. ID 0067 


AAUUCCAUCGUGUAAUCAA 


Cyclo 


37 


SEQ. ID 0068 


UUCCAUCGUGUAAUCAAGG 


Cyclo 


38 


SEQ. ID 0069 


CCAUCGUGUAAUCAAGGAC 


Cyclo 


39 


SEQ. ID 0070 


AUCGUGUAAUCAAGGACUU 


Cyclo 


40 


SEQ. ID 0071 


CGUGUAAUCAAGGACUUCA 


Cyclo 


41 


SEQ. ID 0072 


UGUAAUCAAGGACUUCAUG 


Cyclo 


42 


SEQ. ID 0073 


UAAUCAAGGACUUCAUGAU 


Cyclo 


43 


SEQ. ID 0074 


AUCAAGGACUUCAUGAUCC 


Cyclo 


44 


SEQ. ID 0075 


CAAGGACUUCAUGAUCCAG 


Cyclo 


45 


SEQ. ID 0076 


AGGACUUCAUGAUCCAGGG 


Cyclo 


46 


SEQ. ID 0077 


GACUUCAUGAUCCAGGGCG 


Cyclo 


47 


SEQ. ID 0078 


CUUCAUGAUCCAGGGCGGA 


Cyclo 


48 


SEQ. ID 0079 


UCAUGAUCCAGGGCGGAGA 


Cyclo 


49 


SEQ. ID 0080 


AUGAUCCAGGGCGGAGACU 


Cyclo 


50 


SEQ. ID 0081 


GAUCCAGGGCGGAGACUUC 


Cyclo 


51 


SEQ. ID 0082 


UCCAGGGCGGAGACUUCAC 


Cyclo 


52 


SEQ. ID 0083 


CAGGGCGGAGACUUCACCA 


Cyclo 


53 


SEQ. ID 0084 


GGGCGGAGACUUCACCAGG 


Cyclo 


54 


SEQ. ID 0085 


GCGGAGACUUCACCAGGGG 


Cyclo 


55 


SEQ. ID 0086 


GGAGACUUCACCAGGGGAG 


Cyclo 


56 


SEQ. ID 0087 


AGACUUCACCAGGGGAGAU 


Cyclo 


57 


SEQ. ID 0088 


* - ACUUCACCAGGGGAGAUGG 


Cyclo 


58 


SEQ. ID 0089 


UUCACCAGGGGAGAUGGCA 


Cyclo 


59 


SEQ. ID 0090 


CACCAGGGGAGAUGGCACA 


Cyclo 


60 


SEQ. ID 0091 


CCAGGGGAGAUGGCACAGG 


Cyclo 


61 


SEQ. ID 0092 


AGGGGAGAUGGCACAGGAG 


Cyclo 


62 


SEQ. ID 0093 


GGGAGAUGGCACAGGAGGA 


Cyclo 


63 


SEQ. ID 0094 


GAGAUGGCACAGGAGGAAA 


Cyclo 


64 


SEQ. ID 0095 


GAUGGCACAGGAGGAAAGA 


Cyclo 


65 


SEQ. ID 0094 


UGGCACAGGAGGAAAGAGC 


Cyclo 


66 


SEQ. ID 0096 


GCACAGGAGGAAAGAGCAU 


Cyclo 


67 


SEQ. ID 0097 


ACAGGAGGAAAGAGCAUCU 


Cyclo 


68 


SEQ. ID 0098 


AGGAGGAAAGAGCAUCUAC 


Cyclo 


69 


SEQ. ID 0099 


GAGG AAAG AGC AUCUACGG 


Cyclo 


70 


SEQ. ID 0100 


GGAAAGAGCAUCUACGGUG 


Cyclo 


71 


SEQ. ID 0101 


AAAGAGCAUCUACGGUGAG 


Cyclo 


72 


SEQ. ID 0102 


AGAGCAUCUACGGUGAGCG 


Cyclo 


73 


SEQ. ID 0103 


AGCAUCUACGGUGAGCGCU 
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Cyclo 


74 


SEQ. ID 0104 


CAUCUACGGUGAGCGCUUC 


Cyclo 


75 


SEQ. ID 0105 


UCUACGGUGAGCGCUUCCC 


Cyclo 


76 


SEQ. ID 0106 


UACGGUGAGCGCUUCCCCG 


Cyclo 


77 


SEQ. ID 0107 


CGGUGAGCGCUUCCCCGAU 


Cyclo 


78 


SEQ. ID 0108 


GUGAGCGCUUCCCCGAUGA 


Cyclo 


79 


SEQ. ID 0109 


GAGCGCUUCCCCGAUGAGA 


Cyclo 


80 


SEQ. ID 0110 


GCGCUUCCCCGAUGAGAAC 


Cyclo 


81 


SEQ. ID 0111 


GCUUCCCCGAUGAGAACUU 


Cyclo 


82 


SEQ. ID 0112 


UUCCCCGAUGAGAACUUCA 


Cyclo 


83 


SEQ. ID 0113 


CCCCGAUGAGAACUUCAAA 


Cyclo 


84 


SEQ. ID 0114 


CCGAUGAGAACUUCAAACU 


Cyclo 


85 


SEQ. ID 0115 


GAUGAGAACUUCAAACUGA 


Cyclo 


86 


SEQ. ID 0116 


UGAGAACUUCAAACUGAAG 


Cyclo 


87 


SEQ. ID 0117 


AGAACUUCAAACUGAAGCA 


Cyclo 


88 


SEQ. ID 0118 


AACUUCAAACUGAAGCACU 


Cyclo 


89 


SEQ. ID 0119 


CUUCAAACUGAAGCACUAC 


Cyclo 


90 


SEQ. ID 0120 


UCAAACUGAAGCACUACGG 


DB 


1 


SEQ. ID 0121 


ACGGGCAAGGCCAAGUGGG 


DB 


2 


SEQ. ID 0122 


CGGGCAAGGCCAAGUGGGA 


DB 


3 


SEQ. ID 0123 


GGGCAAGGCCAAGUGGGAU 


DB 


4 


SEQ. ID 0124 


GGCAAGGCCAAGUGGGAUG 


DB 


5 


SEQ. ID 0125 


GCAAGGCCAAGUGGGAUGC 


DB 


6 


SEQ. ID 0126 


CAAGGCCAAGUGGGAUGCC 


DB 


7 


SEQ. ID 0127 


AAGGCCAAGUGGGAUGCCU 


DB 


8 


SEQ. ID 0128 


AGGCCAAGUGGGAUGCCUG 


DB 


9 . 


SEQ. ID 0129 


GGCCAAGUGGGAUGCCUGG 


DB 


10 


SEQ. ID 0130 


GCCAAGUGGGAUGCCUGGA 


DB 


11 


SEQ. ID 0131 


CCAAGUGGGAUGCCUGGAA 


DB 


12 


SEQ. ID 0132 


CAAGUGGGAUGCCUGGAAU 


DB 


13 


SEQ. ID 0133 


AAGUGGGAUGCCUGGAAUG 


DB 


14 


SEQ. ID 0134 


AGUGGGAUGCCUGGAAUGA 


DB 


15 


SEQ. ID 0135 


GUGGGAUGCCUGGAAUGAG 


DB 


16 


SEQ. ID 0136 


UGGGAUGCCUGGAAUGAGC 


DB 


17 


SEQ. ID 0137 


GGGAUGCCUGGAAUGAGCU 


DB 


18 


SEQ. ID 0138 


GGAUGCCUGGAAUGAGCUG 


DB 


19 


SEQ. ID 0139 


GAUGCCUGGAAUGAGCUGA 


DB 


20 


SEQ. ID 0140 


AUGCCUGGAAUGAGCUGAA 


DB 


21 


SEQ. ID 0141 


UGCCUGGAAUGAGCUGAAA 


DB 


22 


SEQ. ID 0142 


GCCUGGAAUGAGCUGAAAG 


DB 


23 


SEQ. ID 0143 


CCUGGAAUGAGCUGAAAGG 
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DB 


24 


SEQ. ID 0144 


CUGGAAUGAGCUGAAAGGG 


DB 


25 


SEQ. ID 0145 


UGGAAUGAGCUGAAAGGGA 


DB 


26 


SEQ. ID 0146 


GGAAUGAGCUGAAAGGGAC 


DB 


27 


SEQ. ID 0147 


GAAUGAGCUGAAAGGGACU 


DB 


28 


SEQ. ID 0148 


AAUGAGCUGAAAGGGACUU 


DB 


29 


SEQ. ID 0149 


AUGAGCUGAAAGGGACUUC 


DB 


30 


SEQ. ID 0150 


UGAGCUGAAAGGGACUUCC 


DB 


31 


SEQ. ID 0151 


GAGCUGAAAGGGACUUCCA 


DB 


32 


SEQ. ID 0152 


AGCUGAAAGGGACUUCCAA 


DB 


33 


SEQ. ID 0153 


GCUGAAAGGGACUUCCAAG 


DB 


34 


SEQ. ID 0154 


CUGAAAGGGACUUCCAAGG 


DB 


35 


SEQ. ID 0155 


UGAAAGGGACUUCCAAGGA 


DB 


36 


SEQ. ID 0156 


GAAAGGGACUUCCAAGGAA 


DB 


37 


SEQ. ID 0157 


AAAGGGACUUCCAAGGAAG 


DB 


38 


SEQ. ID 0158 


AAGGGACUUCCAAGGAAGA 


DB 


39 


SEQ. ID 0159 


AGGGACUUCCAAGGAAGAU 


DB 


40 


SEQ. ID 0160 


GGGACUUCCAAGGAAGAUG 


DB 


41 


SEQ. ID 0161 


GGACUUCCAAGGAAGAUGC 


DB 


42 


SEQ. ID 0162 


GACUUCCAAGGAAGAUGCC 


DB 


43 


SEQ. ID 0163 


ACUUCCAAGGAAGAUGCCA 


DB 


44 


SEQ. ID 0164 


CUUCCAAGGAAGAUGCCAU 


DB 


45 


SEQ. ID 0165 


UUCCAAGGAAGAUGCCAUG 


DB 


46 


SEQ. ID 0166 


UCCAAGGAAGAUGCCAUGA 


DB 


47 


SEQ. ID 0167 


CGAAGGAAGAUGCCAUGAA 


DB 


48 


SEQ. ID 0168 


CAAGGAAGAUGCCAUGAAA 


DB 


49 


SEQ. ID 0169 


AAGGAAGAUGCCAUGAAAG 


DB 


50 


SEQ. ID 0170 


AGGAAGAUGCCAUGAAAGC 


DB 


51 


SEQ. ID 0171 


GGAAGAUGCCAUGAAAGCU 


DB 


52 


SEQ. ID 0172 


GAAGAUGCCAUGAAAGCUU 


DB 


53 


SEQ. ID 0173 


AAGAUGCCAUGAAAGCUUA 


DB 


54 


SEQ. ID 0174 


AGAUGCCAUGAAAGCUUAC 


DB 


55 


SEQ. ID 0175 


GAUGCCAUGAAAGCUUACA 


DB 


56 


SEQ. ID 0176 


AUGCCAUGAAAGCUUACAU 


DB 


57 


SEQ. ID 0177 


UGCCAUGAAAGCUUACAUC 


DB 


58 


SEQ. ID 0178 


GCCAUGAAAGCUUACAUCA 


DB 


59 


SEQ. ID 0179 


CCAUGAAAGCUUACAUCAA 


DB 


60 


SEQ. ID 0180 


CAUGAAAGCUUACAUCAAC 


DB 


61 


SEQ. ID 0181 


AUGAAAGCUUACAUCAACA 


DB 


62 


SEQ. ID 0182 


UGAAAGCUUACAUCAACAA 


DB 


63 


SEQ. ID 0183 


GAAAGCUUACAUCAACAAA 
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DB 


64 


SEQ. ID 0184 


AAAGCUUACAUCAACAAAG 


DB 


65 


SEQ. ID 0185 


AAGCUUACAUCAACAAAGU 


DB 


66 


SEQ. ID 0186 


AGCUUACAUCAACAAAGUA 


DB 


67 


SEQ. ID 0187 


GCUUACAUCAACAAAGUAG 


DB 


68 


SEQ. ID 0188 


CUUACAUCAACAAAGUAGA 


DB 


69 


SEQ. ID 0189 


UUACAUCAACAAAGUAGAA 


DB 


70 


SEQ. ID 0190 


UACAUCAACAAAGUAGAAG 


DB 


71 


SEQ. ID 0191 


ACAUCAACAAAGUAGAAGA 


DB 


72 


SEQ. ID 0192 


CAUCAACAAAGUAGAAGAG 


DB 


73 


SEQ. ID 0193 


AUCAACAAAGUAGAAGAGC 


DB 


74 


SEQ. ID 0194 


UCAACAAAGUAGAAGAGCU 


DB 


75 


SEQ. ID 0195 


CAACAAAGUAGAAGAGCUA 


DB 


76 


SEQ. ID 0196 


AACAAAGUAGAAGAGCUAA 


DB 


77 


SEQ. ID 0197 


ACAAAGUAGAAGAGCUAAA 


DB 


78 


SEQ. ID 0198 


CAAAGUAGAAGAGCUAAAG 


DB 


79 


SEQ. ID 0199 


AAAGUAGAAGAGCUAAAGA 


DB 


80 


SEQ. ID 0200 


AAGUAGAAGAGCUAAAGAA 


DB 


81 


SEQ. ID 0201 


AGUAGAAGAGCUAAAGAAA 


DB 


82 


SEQ. ID 0202 


GUAGAAGAGCUAAAGAAAA 


DB 


83 


SEQ. ID 0203 


UAGAAGAGCUAAAGAAAAA 


DB 


84 


SEQ. ID 0204 


AGAAGAGCUAAAGAAAAAA 


DB 


85 


SEQ. ID 0205 


GAAGAGCUAAAGAAAAAAU 


DB 


86 


SEQ. ID 0206 


AAGAGCUAAAGAAAAAAUA 


DB 


87 


SEQ. ID 0207 


AGAGCUAAAGAAAAAAUAC 


DB 


88 


SEQ. ID 0208 


GAGCUAAAGAAAAAAUACG 


DB 


89 


SEQ. ID 0209 


AGCUAAAGAAAAAAUACGG 


DB 


90 


SEQ. ID 0210 


GCUAAAGAAAAAAUACGGG 


Luc 


1 


SEQ. ID 0211 


AUCCUCAUAAAGGCCAAGA 


Luc 


2 


SEQ. ID 0212 


AGAUCCUCAUAAAGGCCAA 


Luc 


3 


SEQ. ID 0213 


AGAGAUCCUCAUAAAGGCC 


Luc 


4 


SEQ. ID 0214 


AGAGAGAUCCUCAUAAAGG 


Luc 


5 


SEQ. ID 0215 


UCAGAGAGAUCCUCAUAAA 


Luc 


6 


SEQ. ID 0216 


AAUCAGAGAGAUCCUCAUA 


Luc 


7 


SEQ. ID 0217 


AAAAUCAGAGAGAUCCUCA 


Luc 


8 


SEQ. ID 0218 


GAAAAAUCAGAGAGAUCCU 


Luc 


9 


SEQ. ID 0219 


AAGAAAAAUCAGAGAGAUC 


Luc 


10 


SEQ. ID 0220 


GCAAGAAAAAUCAGAGAGA 


Luc 


11 


SEQ. ID 0221 


ACGCAAGAAAAAUCAGAGA 


Luc 


12 


SEQ. ID 0222 


CGACGCAAGAAAAAUCAGA 


Luc 


13 


SEQ. ID 0223 


CUCGACGCAAGAAAAAUCA 
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Luc 


14 


SEQ. ID 0224 


AACUCGACGCAAGAAAAAU 


Luc 


15 


SEQ. ID 0225 


AAAACUCGACGCAAGAAAA 


Luc 


16 


SEQ. ID 0226 


GGAAAACUCGACGCAAGAA 


Luc 


17 


SEQ. ID 0227 


CCGGAAAACUCGACGCAAG 


Luc 


18 


SEQ. ID 0228 


UACCGGAAAACUCGACGCA 


Luc 


19 


SEQ. ID 0229 


CUUACCGGAAAACUCGACG 


Luc 


20 


SEQ. ID 0230 


GUCUUACCGGAAAACUCGA 


Luc 


21 


SEQ. ID 0231 


AGGUCUUACCGGAAAACUC 


Luc 


22 


SEQ. ID 0232 


AAAGGUCUUACCGGAAAAC 


Luc 


23 


SEQ. ID 0233 


CGAAAGGUCUUACCGGAAA 


Luc 


24 


SEQ. ID 0234 


ACCGAAAGGUCUUACCGGA 


Luc 


25 


SEQ. ID 0235 


GUACCGAAAGGUCUUACCG 


Luc 


26 


SEQ. ID 0236 


AAGUACCGAAAGGUCUUAC 


Luc 


27 


SEQ. ID 0237 


CGAAGUACCGAAAGGUCUU 


Luc 


28 


SEQ. ID 0238 


GACGAAGUACCGAAAGGUC 


Luc 


29 


SEQ. ID 0239 


UGGACGAAGUACCGAAAGG 


Luc 


30 


SEQ. ID 0240 


UGUGGACGAAGUACCGAAA 


Luc 


31 


SEQ. ID 0241 


UUUGUGGACGAAGUACCGA 


Luc 


32 


SEQ. ID 0242 


UGUUUGUGGACGAAGUACC 


Luc 


33 


SEQ. ID 0243 


UGUGUUUGUGGACGAAGUA 


Luc 


34 


SEQ. ID 0244 


GUUGUGUUUGUGGACGAAG 


Luc 


35 


SEQ. ID 0245 


GAGUUGUGUUUGUGGACGA 


Luc 


36 


SEQ. ID 0246 


AGGAGUUGUGUUUGUGGAC 


Luc 


37 


SEQ. ID 0247 


GGAGGAGUUGUGUUUGUGG 


Luc 


38 


SEQ. ID 0248 


GCGGAGGAGUUGUGUUUGU 


Luc 


39 


SEQ. ID 0249 


GCGCGGAGGAGUUGUGUUU 


Luc 


40 


SEQ. ID 0250 


UUGCGCGGAGGAGUUGUGU 


Luc 


41 


SEQ. ID 0251 


AGUUGCGCGGAGGAGUUGU 


Luc 


42 


SEQ. ID 0252 


AAAGUUGCGCGGAGGAGUU 


Luc 


43 


SEQ. ID 0253 


AAAAAGUUGCGCGGAGGAG 


Luc 


44 


SEQ. ID 0254 


CGAAAAAGUUGCGCGGAGG 


Luc 


45 


SEQ. ID 0255 


CGCGAAAAAGUUGCGCGGA 


Luc 


46 


SEQ. ID 0256 


ACCGCGAAAAAGUUGCGCG 


Luc 


47 


SEQ. ID 0257 


CAACCGCGAAAAAGUUGCG 


Luc 


48 


SEQ. ID 0258 


AACAACCGCGAAAAAGUUG 


Luc 


49 


SEQ. ID 0259 


GUAACAACCGCGAAAAAGU 


Luc 


50 


SEQ. ID 0260 


AAGUAACAACCGCGAAAAA 


Luc 


51 


SEQ. ID 0261 


UCAAGUAACAACCGCGAAA 


Luc 


52 


SEQ. ID 0262 


AGUCAAGUAACAACCGCGA 


Luc 


53 


SEQ. ID 0263 


CCAGUCAAGUAACAACCGC 
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Luc 


54 


SEQ. ID 0264 


CGCCAGUCAAGUAACAACC 


Luc 


55 


SEQ. ID 0265 


GUCGCCAGUCAAGUAACAA 


Luc 


56 


SEQ. ID 0266 


ACGUCGCCAGUCAAGUAAC 


Luc 


57 


SEQ. ID 0267 


UUACGUCGCCAGUCAAGUA 


Luc 


58 


SEQ. ID 0268 


GAUUACGUCGCCAGUCAAG 


Luc 


59 


SEQ. ID 0269 


UGGAUUACGUCGCCAGUCA 


Luc 


60 


SEQ. ID 0270 


CGUGGAUUACGUCGCCAGU 


Luc 


61 


SEQ. ID 0271 


AUCGUGGAUUACGUCGCCA 


Luc 


62 


SEQ. ID 0272 


AGAUCGUGGAUUACGUCGC 


Luc 


63 


SEQ. ID 0273 


AGAGAUCGUGGAUUACGUC 


Luc 


64 


SEQ. ID 0274 


AAAGAGAUCGUGGAUUACG 


Luc 


65 


SEQ. ID 0275 


AAAAAGAGAUCGUGGAUUA 


Luc 


66 


SEQ. ID 0276 


GGAAAAAGAGAUCGUGGAU 


Luc 


67 


SEQ. ID 0277 


ACGGAAAAAGAGAUCGUGG 


Luc 


68 


SEQ. ID 0278 


UGACGGAAAAAGAGAUCGU 


Luc 


69 


SEQ. ID 0279 


GAUGACGGAAAAAGAGAUC 


Luc 


70 


SEQ. ID 0280 


ACGAUGACGGAAAAAGAGA 


Luc 


71 


SEQ. ID 0281 


AGACGAUGACGGAAAAAGA 


Luc 


72 


SEQ. ID 0282 


AAAGACGAUGACGGAAAAA 


Luc 


73 


SEQ. ID 0283 


GGAAAGACGAUGACGGAAA 


Luc 


74 


SEQ. ID 0284 


ACGGAAAGACGAUGACGGA 


Luc 


75 


SEQ. ID 0285 


GCACGGAAAGACGAUGACG 


Luc 


76 


SEQ. ID 0286 


GAGCACGGAAAGACGAUGA 


Luc 


77 


SEQ. ID 0287 


UGGAGCACGGAAAGACGAU 


Luc 


78 


SEQ. ID 0288 


UUUGGAGCACGGAAAGACG 


Luc 


79 


SEQ. ID 0289 


GUUUUGGAGCACGGAAAGA 


Luc 


80 


SEQ. ID 0290 


UUGUUUUGGAGCACGGAAA 


Luc 


81 


SEQ. ID 0291 


UGUUGUUUUGGAGCACGGA 


Luc 


82 


SEQ. ID 0292 


GUUGUUGUUUUGGAGCACG 


Luc 


83 


SEQ. ID 0293 


CCGUUGUUGUUUUGGAGCA 


Luc 


84 


SEQ. ID 0294 


CGCCGUUGUUGUUUUGGAG 


Luc 


85 


SEQ. ID 0295 


GCCGCCGUUGUUGUUUUGG 


Luc 


86 


SEQ. ID 0296 


CCGCCGCCGUUGUUGUUUU 


Luc 


87 


SEQ. ID 0297 


UCCCGCCGCCGUUGUUGUU 


Luc 


88 


SEQ. ID 0298 


CUUCCCGCCGCCGUUGUUG 


Luc 


89 


SEQ. ID 0299 


AACUUCCCGCCGCCGUUGU 


Luc 


90 


SEQ. ID 0300 


UGAACUUCCCGCCGCCGUU 
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Example II. Validation of the Algorithm using DBI, Luciferase, PLK, EGFR, 
and SEAP 

The algorithm (Formula VIII) identified siRNAs for five genes, human DBI, 
firefly luciferase (fLuc), renilla luciferase (rLuc), human PLK, and human secreted 
5 alkaline phosphatase (SEAP). Four individual siRNAs were selected on the basis of 
their SMARTscores™ derived by analysis of their sequence using Formula VIII (all 
of the siRNAs would be selected with Formula IX as well) and analyzed for their 
ability to silence their targets' expression. In addition to the scoring, a BLAST search 
was conducted for each siRNA. To minimize the potential for off-target silencing 

1 0 effects, only those target sequences with more than three mismatches against un- 
related sequences were selected. Semizarov, et al t Specificity of short interfering KN A 
determined through gene expression signatures. Proc. Natl. Acad. Sci. U.S.A. 2003, 
100:6347. These duplexes were analyzed individually and in pools of 4 and 
compared with several siRNAs that were randomly selected. The functionality was 

1 5 measured a percentage of targeted gene knockdown as compared to controls. All 

siRNAs were transfected as described by the methods above at 100 nM concentration 
into HEK293 using Lipofectamine 2000. The level of the targeted gene expression 
was evaluated by B-DNA as described above and normalized to the non-specific 
control. Figure 10 shows that the siRNAs selected by the algorithm disclosed herein 

20 were significantly more potent than randomly selected siRNAs. The algorithm 

increased the chances of identifying an F50 siRNA from 48% to 91%, and an F80 
siRNA from 13% to 57%. In addition, pools of SMART siRNA silence the selected 
target better than randomly selected pools (see Figure 10F). 

25 Example III. Validation of the Algorithm Using Genes Involved in Clathrin- 
Dependent Endocytosis. 

Components of clathrin-mediated endocytosis pathway are key to modulating 
intracellular signaling and play important roles in disease. Chromosomal 
rearrangements that result in fusion transcripts between the Mixed-Lineage Leukemia 
30 gene (MLL) and CALM (Clathrin assembly lymphoid myeloid leukemia gene) are 
believed to play a role in leukemogenesis. Similarly, disruptions in Rab7 and Rab9, 
as well as HIP1 (Huntingtin-interacting protein), genes that are believed to be 
involved in endocytosis, are potentially responsible for ailments resulting in lipid 
storage, and neuronal diseases, respectively. For these reasons, siRNA directed 
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against clathrin and other genes involved in the clathrin-mediated endocytotic 
pathway are potentially important research and therapeutic tools. 

siRNAs directed against genes involved in the clathrin-mediated endocytosis 
5 pathways were selected using Formula VIIL The targeted genes were clathrin heavy 
chain (CHC, accession # NM__004859), clathrin light chain A (CLCa, NM_001833), 
clathrin light chain B (CLCb, NM 001834), CALM (U45976), (32 subunit of AP-2 
((32, NM_001282), EpslS (NM_001981), EpslSR (NM_021235) ? dynamin II 
(DYNII, NM_004945), Rab5a (BC001267), Rab5b (NMJ)02868), Rab5c 
10 (AF141304), and EEA.l (XMJH8197). 



For each gene, four siRNAs duplexes with the highest scores were selected 
and a BLAST search was conducted for each of them using the Human EST database. 
In order to minimize the potential for off-target silencing effects, only those 
1 5 sequences with more than three mismatches against un-related sequences were used. 
All duplexes were synthesized at Dharmacon, Inc. as 21-mers with 3'-UU overhangs 
using a modified method of 2' -ACE chemistry Scaringe, Advanced 5'-sifyl-2 f ~ 
orthoester approach to RNA oligonucleotide synthesis, Methods Enzymol 2000, 317:3 
and the antisense strand was chemically phosphorylated to insure maximized activity. 

20 

HeLa cells were grown in Dulbecco's modified Eagle's medium (DMEM) 
containing 10% fetal bovine serum, antibiotics and glutamine. siRNA duplexes were 
resuspended in IX siRNA Universal buffer (Dharmacon, Inc.) to 20jaM prior to 
transfection. HeLa cells in 12-well plates were transfected twice with 4pl of 20jjM 

25 siRNA duplex in 3 pi Lipofectamine 2000 reagent (Invitrogen, Carlsbad, California, 

USA) at 24-hour intervals. For the transfections in which 2 or 3 siRNA duplexes were 
included, the amount of each duplex was decreased, so that the total amount was the 
same as in transfections with single siRNAs. Cells were plated into normal culture 
medium 12 hours prior to experiments, and protein levels were measured 2 or 4 days 

30 after the first transfection. 

Equal amounts of lysates were resolved by electrophoresis, blotted, and 
stained with the antibody specific to targeted protein, as well as antibodies specific to 
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unrelated proteins, PP1 phosphatase and TsglOl (not shown). The cells were lysed in 
Triton X-100/glycerol solubilization buffer as described previously. Tebar, 
Bohlander, & Sorkin, Clathrin Assembly Lymphoid Myeloid Leukemia (CALM) 
Protein: Localization in Endocytic-coated Pits, Interactions with Clathrin, and the 
5 Impact of Overexpression on Clathrin-mediated Traffic, Mol. Biol. Cell Aug 1999, 
10:2687. Cell lysates were electrophoresed, transferred to nitrocellulose membranes, 
and Western blotting was performed with several antibodies followed by detection 
using enhanced chemiluminescence system (Pierce, Inc). Several x-ray films were 
analyzed to determine the linear range of the chemiluminescence signals, and the 
10 quantifications were performed using densitometry and Alphalmager v5.5 software 
(Alpha Innotech Corporation). In experiments with EpslSR-targeted siRNAs, cell 
lysates were subjected to immunoprecipitation with Ab860, and EpslSR was detected 
in immunoprecipitates by Western blotting as described above. 



1 5 The antibodies to assess the levels of each protein by Western blot were 

obtained from the following sources: monoclonal antibody to clathrin heavy chain 
(TD.l) was obtained from American Type Culture Collection (Rockville, MD, USA); 
polyclonal antibody to dynamin II was obtained from Affinity Bioreagents, Inc. 
(Golden, CO, USA); monoclonal antibodies to EEA.l and Rab5a were purchased 

20 from BD Transduction Laboratories (Los Angeles, CA, USA); the monoclonal 

antibody to TsglOl was purchased from Santa Cruz Biotechnology, Inc. (Santa Cruz, 
CA, USA); the monoclonal antibody to GFP was from ZYMED Laboratories Inc. 
(South San Francisco,* CA^-USA); the rabbit polyclonal antibodies Ab32 specific to oc- 
adaptins and Ab20 to CALM were described previously Sorkin, et al, Stoichiometric 

25 Interaction of the Epidermal Growth Factor Receptor with the Clathrin-associated 

Protein Complex AP-2, J, Biol Chem. Jan 1995, 270:619, the polyclonal antibodies to 
clathrin light chains A and B were kindly provided by Dr. F. Brodsky (UCSF); 
monoclonal antibodies to PP1 (BD Transduction Laboratories) and a-Actinin 
(Chemicon) were kindly provided by Dr. M. DelVAcqua (University of Colorado); 

30 Epsl 5 Ab577 and Epsl 5R Ab860 were kindly provided by Dr. P.P. Di Fiore 
(European Cancer Institute). 
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Figure 11 demonstrates the in vivo functionality of 48 individual siRNAs, 
selected using Formula VIII (most of them will meet the criteria incorporated by 
Formula IX as well) targeting 12 genes. Various cell lines were transfected with 
siRNA duplexes (Dupl-4) or pools of siRNA duplexes (Pool), and the cells were 
5 lysed 3 days after transfection with the exception of CALM (2 days) and P2 (4 days). 



Note a pl-adaptin band (part of AP-1 Golgi adaptor complex) that runs 
slightly slower than (32 adaptin. CALM has two splice variants, 66 and 72 kD. The 
full-length Epsl5R (a doublet of -130 kD) and several truncated spliced forms of - 

10 1 00 kD and -70 kD were detected in Epsl 5R immunoprecipitates (shown by arrows). 
The cells were lysed 3 days after transfection. Equal amounts of lysates were 
resolved by electrophoresis and blotted with the antibody specific to a targeted protein 
(GFP antibody for YFP fusion proteins) and the antibody specific to unrelated 
proteins PP1 phosphatase or a-actinin ? and TSG101. The amount of protein in each 

15 specific band was normalized to the amount of non-specific proteins in each lane of 
the gel. Nearly all of them appear to be functional, which establishes that Formula 
VIII and IX can be used to predict siRNAs' functionality in general in a genome wide 
manner. 

20 To generate the fusion of yellow fluorescent protein (YFP) with Rab5b or 

Rab5c (YFP-Rab5b or YFP-Rab5c), a DNA fragment encoding the full-length human 
Rab5b or Rab5c was obtained by PGR using Pfu polymerase (Stratagene) with a Sad 
restriction site introduced ihtcf the 5' end and a Kpnl site into the 3 5 end and cloned 
into pEYFP-Cl vector (CLONTECH, Palo Alto, CA, USA). GFP-CALM and YFP- 

25 Rab5a were described previously Tebar, Bohlander, & Sorkin, Clathrin Assembly 

Lymphoid Myeloid Leukemia (CALM) Protein: Localization in Endocytic-coated Pits, 
Interactions with Clathrin, and the Impact of Overexpression on Clathrin-mediated 
Traffic, Mol. Biol. Cell Aug 1999, 10:2687. 

30 Example III. Validation of the Algorithm Using Eg5, GADPH, ATE1, MEK2, 
MEK1, QB ? LaminA/C, c-myc 9 human cyclophilin, and mouse cyclophilin. 
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A number of genes have been identified as playing potentially important roles 
in disease etiology. Expression profiles of normal and diseased kidneys has 
implicated Edg5 in immunoglobulin A neuropathy, a common renal glomerular 
disease. Mycl, MEK1/2 and other related kinases have been associated with one or 
5 more cancers, while lamins have been implicated in muscular dystrophy and other 
diseases. For these reasons, siRNA directed against the genes encoding these classes 
of molecules would be important research and therapeutic tools. 

Figure 12 illustrates four siRNAs targeting 10 different genes (Table V for 
10 sequence and accession number information) that were selected according to the 
Formula VIII and assayed as individuals and pools in HEK293 cells. The level of 
siRNA induced silencing was measured using the B-DNA assay. These studies 
demonstrated that thirty- six out of the forty individual SMART-selected siRNA tested 
are functional (90%) and all 10 pools are fully functional. 

15 

Example V. Validation of the Algorithm Using Bcl2 

Bcl-2 is a ~25kD, 205-239 amino acid, anti-apoptotic protein that contains 
considerable homology with other members of the BCL family including BCLX, 
MCL1, BAX, BAD, and BIK. The protein exists in at least two forms (Bcl2a 5 which 
20 has a hydrophobic tail for membrane anchorage, and Bcl2b, which lacks the 

hydrophobic tail) and is predominantly localized to the mitochondrial membrane. 
While Bcl2 expression is widely distributed, particular interest has focused on the 
expression of this molecule in B^and T cells. Bcl2 expression is down-regulated in 
normal germinal center B cells yet in a high percentage of follicular lymphomas, Bcl2 
25 expression has been observed to be elevated. Cytological studies have identified a 

common translocation ((14;18)(q32;q32)) amongst a high percentage (>70%) of these 
lymphomas. This genetic lesion places the Bcl2 gene in juxtaposition to 
immunoglobulin heavy chain gene (IgH) encoding sequences and is believed to 
enforce inappropriate levels of gene expression, and resistance to programmed cell 
30 death in the follicle center B cells. In other cases, hypomethylation of the Bcl2 

promoter leads to enhanced expression and again, inhibition of apoptosis. In addition 
to cancer, dysregulated expression of Bcl-2 has been correlated with multiple sclerosis 
and various neurological diseases. 
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The correlation between Bcl-2 translocation and cancer makes this gene an 
attractive target for RNAi. Identification of siRNA directed against the bcl2 transcript 
(or Bcl2-IgH fusions) would further our understanding Bcl2 gene function and 
possibly provide a future therapeutic agent to battle diseases that result from altered 
5 expression or function of this gene. 



In Silico Identification of Functional siRNA 

To identify functional and hyperfunctional siRNA against the Bcl2 gene, the 
sequence for Bcl-2 was downloaded from the NCBI Unigene database and analyzed 
10 using the Formula VIII algorithm. As a result of these procedures, both the sequence 
and SMARTscores™ of the Bcl2 siRNA were obtained and ranked according to their 
functionality. Subsequently, these sequences were BLAST' ed (database) to insure that 
the selected sequences were specific and contained minimal overlap with unrealated 
genes. The SMARTscores™ for the top 10 Bcl-2 siRNA are identified in Figure 13. 

15 

In Vivo Testing of Bcl-2 SiRNA 

Bcl-2 siRNAs having the top ten SMARTscores™ were selected and tested in 
a functional assay to determine silencing efficiency. To accomplish this, each of the 
ten duplexes were synthesized using 2 5 -O ACE chemistry and transfected at lOOnM 
20 concentrations into cells. Twenty-four hours later assays were performed on cell 
extracts to assess the degree of target silencing. Controls used in these experiments 
included mock transfected cells, and cells that were transfected with a non-specific 
siRNA duplex. 

25 The results of these experiments are presented below (and in Figure 14) and 

show that all ten of the selected siRNA induce 80% or better silencing of the Bcl2 
message at lOOnM concentrations. These data verify that the algorithm successfully 
identified functional Bcl2 siRNA and provide a set of functional agents that can be 
used in experimental and therapeutic environments. 
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siRNA 1 GGGAGAUAGUGAUGAAGUA SEQ. ID NO. 301 

siRNA 2 GAAGUACAUCCAUUAUAAG SEQ. ID NO. 302 

siRNA 3 GUACGACAACCGGGAGAUA SEQ. ID NO. 303 

siRNA 4 AGAUAGUGAUGAAGUACAU SEQ. ID NO. 304 

siRNA 5 UGAAGACUCUGCUCAGUUU SEQ. ID NO. 305 

siRNA 6 GCAUGCGGCCUCUGUUUGA SEQ. ID NO. 306 

siRNA 7 UGCGGCCUCUGUUUGAUUU SEQ. ID NO. 307 

siRNA 8 GAGAUAGUGAUGAAGUACA SEQ. ID NO. 308 

siRNA 9 GGAGAUAGUGAUGAAGUAC SEQ. ID NO. 309 

siRNA 1 0 GAAGACUCUGCUCAGUUUG SEQ. ID NO. 3 1 0 



10 



Bcl2 siRNA: Sense Strand, 5'->3' 



Example VI. Sequences Selected by the Algorithm 

Sequences of the siRNAs selected using Formulas (Algorithms) VIII and IX 
1 5 with their corresponding ranking, which have been evaluated for the silencing activity 
in vivo in the present study (Formula VIII and IX, respectively). 



TABLE V 



Gene 


Accession 






Formula 


Formula 


Name 


Number 


SEQ. ID NO. 


FTllSeqTence 


VIII 


IX 


CLTC 


NM_004859 


SEQ. ID NO. 0301 


GAAAGAATCTGTAGAGAAA 


76 


94.2 


CLTC 


NM__004859 


SEQ. ID NO. 0302 


GCAATGAGCTGTTTGAAGA 


65 


39.9 


CLTC 


NMJ)04859 


SEQ. ID NO. 0303 


TGACAAAGGTGGATAAATT 


57 


38.2 


CLTC 


NM_004859 


SEQ. ID NO, 0304 


GGAAATGGATCTCTTTGAA 


54 


49.4 


CLTA 


NM_001833 


SEQ. ID NO. 0305 


GGAAAGTAATGGTCCAACA 


22 


55.5 


CLTA 


NMJ)01833 


SEQ. ID NO. 0306 


AGACAGTTATGCAGCTATT 


4 


22.9 


CLTA 


NMJ301833 


SEQ. ID NO. 0307 


CCAATTCTCGGAAGCAAGA 


1 


17 


CLTA 


NMJ)01833 


SEQ. ID NO. 0308 


GAAAGTAATGGTCCAACAG 


-1 


-13 


CLTB 


NMJ)01834 


SEQ. ID NO. 0309 


GCGCCAGAGTGAACAAGTA 


17 


57.5 


CLTB 


NM_001834 


SEQ. ID NO. 0310 


GAAGGTGGCCCAGCTATGT 


15 


-8.6 


CLTB 


NMJ)01834 


SEQ. ID NO. 0311 


GGAACCAGCGCCAGAGTGA 


13 


40.5 


CLTB 


NM_001834 


SEQ. ID NO. 0312 


GAGCGAGATTGCAGGCATA 


20 


61.7 


CALM 


U45976 


SEQ. ID NO. 0313 


GTTAGTATCTGATGACTTG 


36 


-34.6 


CALM 


U45976 


SEQ. ID NO. 0314 


GAAATGGAACCACTAAGAA 


33 


46.1 


CALM 


U45976 


SEQ. ID NO. 0315 


GGAAATGGAACCACTAAGA 


30 


61.2 


CALM 


U45976 


SEQ. ID NO. 0316 


CAACTACACTTTCCAATGC 


28 


6.8 


EPS 15 


NMJXH981 


SEQ. ID NO. 0317 


CCACCAAGATTTCATGATA 


48 


25.2 
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EPS15 


NM_001981 


SEQ. ID NO. 0318 


EPS 15 


NM_001981 


SEQ. ID NO. 0319 


EPS 15 


NMJD01981 


SEQ. ID NO. 0320 


EPS15R 


NMJ)21235 


SEQ. ID NO. 0321 


EPS15R 


NM 021235 


SEQ. ID NO. 0322 


EPS15R 


NM_021235 


SEQ. ID NO. 0323 


EPS15R 


NM_021235 


SEQ. ID NO. 0324 


DNM2 


NM_004945 


SEQ. ID NO. 0325 


DNM2 


NM_004945 


SEQ. ID NO. 0326 


DNM2 


NMJ304945 


SEQ. ID NO. 0327 


DNM2 


NM_004945 


SEQ. ID NO. 0328 


ARF6 


AF93885 


SEQ. ID NO. 0329 


ARF6 


AF93885 


SEQ. ID NO. 0330 


ARF6 


AF93885 


SEQ. .ID NO. 0331 


ARP6 


AF93885 


SEQ. ID NO. 0332 


RAB5A 


BC001267 


SEQ. ID NO. 0333 


RAB5A 


BC001267 


SEQ. ID NO. 0334 


RAB5A 


BC001267 


SEQ. ID NO. 0335 


RAB5A 


BC001267 


SEQ. ID NO. 0336 


RAB5B 


NM_002868 


SEQ. ID NO. 0337 


RAB5B 


NM_002868 


SEQ. ID NO. 0338 


RAB5B 


NM 002868 


SEQ. ID NO. 0339 


RAB5B 


NM_002868 


SEQ. ID NO. 0340 


RAB5C 


AF141304 


SEQ. ID NO. 0341 


RAB5C 


AP141304 


SEQ. ID NO. 0342 


RAB5C 


AF141304 


SEQ. ID NO. 0343 


RAB5C 


AF141304 


SEQ. ID NO. 0344 


EEA1 


XMJ318197 


SEQ. ID NO. 0345 


EEA1 


XM_018197 


SEQ. ID NO. 0346 


EEA1 


XM_018197 


SEQ. ID NO. 0347 


EEA1 


XM_018197 


SEQ. ID NO. 0348 


AP2B1 


NM_001282 


SEQ. ID NO. 0349 


AP2B1 


NM 001282 


SEQ. ID NO. 0350 


AP2B1 


NM_001282 


SEQ. ID NO. 0351 


AP2B1 


NM_001282 


SEQ. ID NO. 0352 


PLK 


NM_005030 


SEQ. ID NO. 0353 


PLK 


NM_005030 


SEQ. ID NO. 0354 


PLK 


NM_005030 


SEQ. ID NO. 0355 


PLK 


NM_005030 


SEQ. ID NO. 0356 


GAPDH 


NM_002046 


SEQ. ID NO. 0357 
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GATCGGAACTCCAACAAGA 


43 


49.3 


AAACGGAGCTACAGATTAT 


39 


11.5 


CCACACAGCATTCTTGTAA 


33 


-23.6 


GAAGTTACCTTGAGCAATC 


48 


33 


GGACTTGGCCGATCCAGAA 


27 


33 


GCACTTGGATCGAGATGAG 


20 


1.3 


CAAAGACCAATTCGCGTTA 


17 


27.7 


CCGAATCAATCGCATCTTC 


6 


-29.6 


GACATGATCCTGCAGTTCA 


5 


-14 


GAGCGAATCGTCACCACTT 


5 


24 


CCTCCGAGCTGGCGTCTAC 


-4 


-63.6 


TCACATGGTTAACCTCTAA 


27 


-21.1 


GATGAGGGACGCCATAATC 


7 


-38.4 


CCTCTAACTACAAATCTTA 


4 


16.9 


GGAAGGTGCTATCCAAAAT 


4 


11.5 


GCAAGCAAGTCCTAACATT 


40 


25.1 


GGAAGAGGAGTAGACCTTA 


17 


50.1 


AGGAATCAGTGTTGTAGTA 


16 


11.5 


GAAGAGGAGTAGACCTTAC 


12 


7 


GAAAGTCAAGCCTGGTATT 


14 


18.1 


AAAGTCAAGCCTGGTATTA 


6 


-17.8 


GCTATGAACGTGAATGATC 


3 


-21.1 


CAAGCCTGGTATTACGTTT 


-7 


-37.5 


GGAACAAGATCTGTCAATT 


38 


51.9, 


GCAATGAACGTGAACGAAA 


29 


43.7 


CAATGAACGTGAACGAAAT 


18 


43.3 


GGACAGGAGCGGTATCACA 


6 


18.2 


AGACAGAGCTTGAGAATAA 


67 


64.1 


GAGAAGATCTTTATGCAAA 


60 


48.7 


GAAGAGAAATCAGCAGATA 


58 


45.7 


GCAAGTAACTCAACTAACA 


56 


72.3 


GAGCTAATCTGCCACATTG 


49 


-12.4 


GCAGATGAGTTACTAGAAA 


44 


48.9 


CAACTJAATTGTCCAGAAA 


41 


28.2 


CAACACAGGATTCTGATAA 


33 


-5.8 


AGATTGTGCCTAAGTCTCT 


-35 


-3.4 


ATGAAGATCTGGAGGTGAA 


0 


-4.3 


TTTGAGACTTCTTGCCTAA 


-5 


-27.7 


AGATCACCCTCCTTAAATA 


15 


72.3 


CAACGGATTTGGTCGTATT 


27 


-2.8 
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GAPDH 


NMJ)02046 


SEQ. ID NO. 0358 


GAAATCCCATCACCATCTT 


24 


3.9 


GAPDH 


NMJ)02046 


SEQ. ID NO. 0359 


GACCTCAACTACATGGTTT 


22 


-22.9 


GAPDH 


NMJ302046 


SEQ.IDNO. 0360 


TGGTTTACATGTTCCAATA 


9 


9.8 


c-Myc 




SEQ. ID NO. 0361 


GAAGAAATCGATGTTGTTT 


31 


-11.7 


c-Myc 




SEQ.IDNO. 0362 


ACACAAACTTGAACAGCTA 


22 


51.3 


c-Myc 




SEQ. ID NO. 0363 


GGAAGAAATCGATGTTGTT 


18 


26 


c-Myc 




SEQ.IDNO. 0364 


GAAACGACGAGAACAGTTG 


18 


-8.9 


MAP2K1 


NMJ)02755 


SEQ.IDNO. 0365 


GCACATGGATGGAGGTTCT 


26 


16 


MAP2K1 


NM_002755 


SEQ. ID NO. 0366 


GCAGAGAGAGCAGATTTGA 


16 


0.4 


MAP2K1 


NM_002755 


SEQ. ID NO. 0367 


GAGGTTCTCTGGATCAAGT 


14 


15.5 


MAP2K1 


NMJ)02755 


SEQ. ID NO. 0368 


GAGCAGATTTGAAGCAACT 


14 


18.5 


MAP2K2 


NMJ)30662 


SEQ. ID NO. 0369 


CAAAGACGATGACTTCGAA 


37 


26.4 


MAP2K2 


NM_030662 


SEQ. ID NO. 0370 


GATCAGCATTTGCATGGAA 


24 


-0.7 


MAP2K2 


NM_030662 


SEQ. ID NO. 0371 


TCCAGGAGTTTGTCAATAA 


17 


-4.5 


MAP2K2 


NM_030662 


SEQ. ID NO. 0372 


GGAAGCTGATCCACCTTGA 


16 


59.2 


KNSL1(EG5) 


NM_004523 


SEQ. ID NO. 0373 


GCAGAAATCTAAGGATATA 


53 


35.8 


KNSL1(EG5) 


NMJ)04523 


SEQ. ID NO. 0374 


CAACAAGGATGAAGTCTAT 


50 


18.3 


KNSL1(EG5) 


NMJ)04523 


SEQ. ID NO. 0375 


CAGCAGAAATCTAAGGATA 


41 


32.7 


KNSL1(EG5) 


NMJ)04523 


SEQ. ID NO. 0376 


CTAGATGGCTTTCTCAGTA 


39 


3.9 


CyclophilinA_ 


NMJ321130 


SEQ. ID NO. 0377 


AGACAAGGTCCCAAAGACA 


-16 


58.1 


CyclophilinA_ 


NM_021130 


SEQ. ID NO. 0378 


GGAATGGCAAGACCAGCAA 


-6 


36 


CyclophilinA_ 


NMJ)21130 


SEQ. ID NO. 0379 


AGAATTATTCCAGGGTTTA 


-3 


16.1 


CyclophilinA_ 


NMJ321130 


SEQ. ID NO. 0380 


GCAGACAAGGTCCCAAAGA 


8 


8.9 


LAMINA/C 


NM_1 70707 


SEQ.IDNO. 0381 


AGAAGCAGCTTCAGGATGA 


31 


38.8 


LAMINA/C 


NM_170707 


SEQ. ID NO. 0382 


GAGCTTGACTTCCAGAAGA 


33 


22.4 


LAMINA/C 


NM_1 70707 


SEQ.IDNO. 0383 


CCACCGAAGTTCACCCTAA 


21 


27.5 


LAMEST A/C 


NM_1 70707 


SEQ. ID NO. 0384 


GAGAAGAGCTCCTCCATCA 


55 


30.1 


CyclophilinB 


M60857 


SEQ.IDNO. 0385 


GAAAGAGCATCTACGGTGA 


41 


83.9 


CyclophilinB 


M60857 


SEQ. ID NO. 0386 


GAAAGGATTTGGCTACAAA 


53 


59.1 


CyclophilinB 


M60857 


SEQ.IDNO. 0387 


ACAGCAAATTCCATCGTGT 


-20 


28.8 


CyclophilinB 


M60857 


SEQ. ID NO. 0388 


GGAAAGACTGTTCCAAAAA 


2 


27 


DBIl 


NM_020548 


SEQ.IDNO. 0389 


CAACACGCCTCATCCTCTA 


27 


-7.6 


DBI2 


NM_020548 


SEQ. ID NO. 0390 


CATGAAAGCTTACATCAAC 


25 


-30.8 


DBI3 


NM_020548 


SEQ.IDNO. 0391 


AAGATGCCATGAAAGCTTA 


17 


22 


DBI4 


NM_020548 


SEQ.IDNO. 0392 


GCACATACCGCCTGAGTCT 


15 


3.9 


rLUCl 




SEQ. ID NO. 0393 


GATCAAATCTGAAGAAGGA 


57 


49.2 


rLUC2 




SEQ.IDNO. 0394 


GCCAAGAAGTTTCCTAATA 


50 


13.7 


rLUC3 




SEQ.IDNO. 0395 


CAGCATATCTTGAACCATT 


41 


-2.2 


rLUC4 




SEQ. ID NO. 0396 


GAACAAAGGAAACGGATGA 


39 


29.2 


SeAPl 


NMJ)31313 


SEQ.IDNO. 0397 


CGGAAACGGTCCAGGCTAT 


6 


26.9 
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SeAP2 


NM_031313 


SEQ. ID NO. 0398 


GCTTCGAGCAGACATGATA 


4 


-11.2 


SeAP3 


NM_031313 


SEQ. ID NO. 0399 


CCTACACGGTCCTCCTATA 


4 


4.9 


SeAP4 


NM031313 


SEQ. ID NO. 0400 


GCCAAGAACCTCATCATCT 


1 


-9.9 


fLUCl 




SEQ. ID NO. 0401 


GATATGGGCTGAATACAAA 


54 


40.4 


fLUC2 




SEQ. ID NO. 0402 


GCACTCTGATTGACAAATA 


47 


54.7 


fLUC3 




SEQ. ID NO. 0403 


TGAAGTCTCTGATTAAGTA 


46 


34.5 


fLUC4 




SEQ. ID NO. 0404 


TCAGAGAGATCCTCATAAA 


40 


11.4 


mCyclo_l 


NM_008907 


SEQ. ID NO. 0405 


GCAAGAAGATCACCATTTC 


52 


46.4 


mCyclo_2 


NM_008907 


SEQ. ID NO. 0406 


GAGAGAAATTTGAGGATGA 


36 


70.7 


mCyclo_3 


NM_008907 


SEQ. ID NO. 0407 


GAAAGGATTTGGCTATAAG 


35 


-1.5 


mCyclo_4 


NM_008907 


SEQ. ID NO. 0408 


GAAAGAAGGCATGAACATT 


27 


10.3 


BCL2_1 


NM_000633 


SEQ. ID NO. 0409 


GGGAGATAGTGATGAAGTA 


21 


72 


BCL2_2 


NM__000633 


SEQ. ID NO. 0410 


GAAGTACATCCATTATAAG 


1 


3.3 


BCL2J3 


NM_000633 


SEQ. ID NO. 04.11 


GTACGACAACCGGGAGATA 


1 


35.9 


BCL2_4 


NM_000633 


SEQ. ID NO. 0412 


AGATAGTGATGAAGTACAT 


-12 


22.1 


BCL2_5 


NM_000633 


SEQ. ID NO. 0413 


TGAAGACTCTGCTCAGTTT 


36 


19.1 


BCL2_6 


NMJ)00633 


SEQ. ID NO. 0414 


GCATGCGGCCTCTGTTTGA 


5 


-9.7 


QB1 


NM 003365.1 


SEQ. ID NO. 0415 


GCACACAGCUUACUACAUC 


52 


-4.8 


QB2 


NM_003365.1 


SEQ. ID NO. 0416 


GAAAUGCCCUGGUAUCUCA 


49 


22.1 


QB3 


NM_003365.1 


SEQ. ID NO. 0417 


GAAGGAACGUGAUGUGAUC 


34 


22.9 


QB4 


NM_003365.1 


SEQ. ID NO. 0418 


GCACUACUCCUGUGUGUGA 


28 


20.4 


ATEM 


NM_007041 


SEQ. ID NO. 0419 


GAACCCAGCUGGAGAACUU 


45 


15.5 


ATE1-2 


NM__007041 


SEQ. ID NO. 0420 


GAUAUACAGUGUGAUCUUA 


40 


12.2 


ATE1-3 


NMJ)07Q41 


SEQ. ID NO. 0421 


GUACUACGAUCCUGAUUAU 


37 


32.9 


ATE1-4 


NM_007041 


SEQ. ID NO. 0422 


GUGCCGACCUUUACAAUUU 


35 


18.2 


EGFR-1 


NM_005228 


SEQ. ID NO. 0423 


GAAGGAAACTGAATTCAAA 


68 


19 A 


EGFR-1 


NM_005228 


SEQ. ID NO. 0424 


GGAAATATGTACTACGAAA 


49 


49.5 


EGFR-1 


NM_005228 


SEQ. ID NO. 0425 


CCACAAAGCAGTGAATTTA 


41 


7.6 


EGFR-1 


NMJ)05228 


SEQ. ID NO. 0426 


GTAACAAGCTCACGCAGTT 


40 


25.9 



Example VII. Genome-Wide Application of the Algorithm 

The examples described above demonstrate that the algorithm(s) can 
5 successfully identify functional siRNA and that these duplexes can be used to induce 
the desirable phenotype of transcriptional knockdown or knockout. Each gene or 
family of genes in each organism plays an important role in maintaining physiological 
homeostasis and the algorithm can be used to develop functional, highly functional, or 
hyperfunctional siRNA to each gene. To accomplish this for the human genome, the 
1 0 entire online ncbi refseq database was accessed through Entrez (efetch). The database 
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was processed through Formula VIII. For each gene the top 80-100 scores for 
siRNAs were obtained and BLAST' ed to insure that the selected sequences are 
specific in targeting the gene of choice. These sequences are provided on the 
enclosed CDs in electronic form. Accordingly, Applicants hereby incorporate by 
5 reference the material submitted herewith, in duplicate on the compact disks labeled 
COPY 1 - TABLES PART, DISK 1/1, TABLES 12 -15, Filed with RO/US under 
PCT AI sec. 801(a), Operating System: MS-Windows, COPY 2 - TABLES PART, 
DISK 1/1, TABLES 12 -15, Filed with RO/US under PCT AI sec. 801(a), Operating 
System: MS-Windows, COPY 3 - TABLES PART, DISK 1/1, TABLES 12 -15, 

10 Filed with RO/US under PCT AI sec. 801(a), Operating System: MS-Windows,; 
which copies are identical, in files entitled Table_12.txt, date of creation June 26, 
2003, with a size of 31,045 kb; Table_13.txt, date of creation November 13, 2003, 
with a size of 78,451 kb; Table_14.txt, date of creation November 13, 2003, with a 
size of 454 kb; and Table_15.txt date of creation November 13, 2003, with a size of 

15 1,690 kb. 

With respect to the disks, there are four tables on each disk copy in text 
format: Tables XII -XV. Table XII, which is located in a file entitled Table_12.txt, 
provides a list of the 80-100 sequences for each target, identified by Formula VIII as 

20 having the highest relative SMARTscores™ for the target analyzed. Table XIII, 

which is located in a file entitled Table_13.txt, provides the SMARTscores™, and for 
each gene, a pool pick of up to four sequences is denoted. (The denotation of "1" in 
Table XIII means that it is a pool pick.) These pool pick sequences represent the most 
functional siRNAs for the corresponding target. Any 1, 2, 3, or 4 of the pool pick 

25 sequences could be used for gene silencing. Further, sequences that are not denoted 
as pool pick sequences, but that are included on the compact disks may also be used 
for gene silencing either alone or in combination with other sequences. However, 
their individual relative functionality would be less than that of a pool pick sequence. 
Table XIV, which is located in a file entitled Table__14.txt, provides an identification 

30 of genes by accession number, and Table XV, which is located in a file entitled 
Table_15.txt, provides a short name for the genes identified on the disk. The 
information contained on the disks is part of this patent application and are 
incorporated into the specification by reference. One may use these tables in order to 
identify functional siRNAs for the gene provided therein, by simply looking for the 
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gene of interest and an siRNA that is listed as functional. Preferably, one would 
select one or more of the siRNA that most optimized for the target of interest and is 
denoted as a pool pick. 



5 Table XII: siRNA selected by Formula VIII 

See data submitted herewith on a CD-ROM in accordance with PCT 
Administrative Instructions Section 801(a) 

Table XIII: SMARTscores™ 

10 See data submitted herewith on a CD-ROM in accordance with PCT 

Administrative Instructions Section 801(a) 

Table XIV: Identification of Targets 

See data submitted herewith on a CD-ROM in accordance with PCT 
15 Administrative Instructions Section 801(a) 

Table XV: Description of Targts 

See data submitted herewith on a CD-ROM in accordance with PCT 
Administrative Instructions Section 801(a) 

20 

Many of the genes to which the described siRNA ai*e directed play critical 
roles in disease etiology. For this reason, the siRNA listed in the accompanying 
compact disk may potentially act as therapeutic agents.. A number of prophetic 
examples follow and should be understood in view of the siRNA that are identified on 
25 the accompanying CD. To isolate these siRNA, the appropriate message sequence for 
each gene is analyzed using one of the before mentioned formulas (preferably formula 
VIII) to identify potential siRNA targets. Subsequently these targets are BLAST' ed to 
eliminate homology with potentially off-targets. 

30 The list of potential disease targets is extensive. For instance, over-expression 

of Bel 10 has been implicated in the development of MALT lymphoma (mucosa 
associated lymphoid tissue lymphoma) and thus, functional, highly functional, or 
hyperfunctional siRNA directed against that gene (e.g. SEQ. ID NO. 0427: 
GGAAACCUCUCAUUGCUAA; SEQ. ID NO. 0428: 
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GAAAGAACCUUGCCGAUCA; SEQ. ID NO. 0429: 
GGAAAUACAUCAGAGCUUA, or SEQ. ID NO. 0430: 

GAAAGUAUGUGUCUUAAGU) may contribute to treatment of this disorder. 

5 In another example, studies have shown that molecules that inhibit 

glut amine : fructo s e- 6 -pho sphat e aminotransferase (GFA) may act to limit the 
symptoms suffered by Type II diabetics. Thus, functional, highly functional, or 
hyperfunctional siRNA directed against GFA (also known as GFPT1 : siRNA = SEQ. 
ID NO. 0433 UGAAACGGCUGCCUGAUUU; SEQ. ID NO. 0434 
10 GAAGUUACCUCUUACAUUU; SEQ. ID NO. 0435 
GUACGAAACUGUAUGAUUA; SEQ. ID NO. 0436 

GGACGAGGCUAUCAUUAUG) may contribute to treatment of this disorder. 

In another example, the von Hippel-Lindau (VHL) tumor suppressor has been 

15 observed to be inactivated at a high frequency in sporadic clear cell renal cell 
carcinoma (RCC) and RCCs associated with VHL disease. The VHL tumor 
suppressor targets hypoxia-inducible factor- 1 alpha (HIF-1 alpha), a transcription 
factor that can induce vascular endothelial growth factor (VEGF) expression, for 
ubiquitination and degradation. Inactivation of VHL can lead to increased levels of 

20 HIF-1 alpha, and subsequent VEGF over expression. Such over expression of VEGF 
has been used to explain the increased (and possibly necessary) vascularity observed 
in RCC. Thus, functional, highly functional, or hyperfunctional siRNAs directed 
against either HIF-1 alpha (SEQ. ID NO. 0437 GAAGGAACGp3AUGCUUUA; 
SEQ. ID NO. 0438 GCAUAUAUCUAGAAGGUAU; SEQ. ID NO. 0439 

25 GAACAAAUACAUGGGAUUA; SEQ. ID NO. 0440 

GGACACAGAUUUAGACUUG) or VEGF (SEQ. ID NO. 0441 
GAACGUACUUGCAGAUGUG; SEQ. ID NO. 0442 
GAGAAAGCAUUUGUUUGUA; SEQ. ID NO. 0443 
GGAGAAAGCAUUUGUUUGU; SEQ. ID NO. 0444 

3 0 CGAGGC AGCUUGAGUUAAA) may be useful in the treatment of renal cell 
carcinoma. 

In another example, gene expression of platelet derived growth factor A and B 
(PDGF-A and PDGF-B) has been observed to be increased 22- and 6-fold, 
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respectively, in renal tissues taken from patients with diabetic nephropathy as 
compared with controls. These findings suggest that over expression of PDGF A and 
B may play a role in the development of the progressive fibrosis that characterizes 
human diabetic kidney disease. Thus, functional, highly functional, or hyperfunctional 
5 siRNAs directed against either PDGF A 

(SEQ. ID NO. 0445: GGUAAGAUAUUGUGCUUUA; 
SEQ. ID NO. 0446: CCGCAAAUAUGCAGAAUUA; 
SEQ. ID NO. 0447: GGAUGUACAUGGCGUGUUA; 
SEQ. ID NO. 0448: GGUGAAGUUUGUAUGUUUA) or 

10 

PDGF B 

(SEQ. ID NO. 0449: CCGAGGAGCUUUAUGAGAU; 
SEQ. ID NO. 0450: GCUCCGCGCUUUCCGAUUU; 
SEQ. ID NO. 0451 GAGCAGGAAUGGUGAGAUG; 
15 SEQ. ID NO. 0452: GAACUUGGGAUAAGAGUGU; 
SEQ. ID NO. 0453 CCGAGGAGCUUUAUGAGAU; 

SEQ. ID NO. 0454 UUUAUGAGAUGCUGAGUGA) may be useful in the treatment 
of this form of kidney disorder. 

20 In another example, a strong correlation exists between the over-expression of 

glucose transporters (e.g. GLUT 12) and cancer cells. It is predicted that cells 
undergoing uncontrolled cell growth up-regulate GLUT molecules so that they can 
cope with the heightened energy needs associated with increased-rates^of proliferation 
and metastasis. Thus, siRNA-based therapies that target the molecules such as 

25 GLUT1 (also known as SLC2A1 : siRNA= 

SEQ. ID NO.: 0455 GCAAUGAUGUCCAGAAGAA; 
SEQ. ID NO.: 0456 GAAGAAUAUUCAGGACUUA; 
SEQ. ID NO.: 0457 GAAGAGAGUCGGCAGAUGA; 
SEQ. ID NO.: 0458 CCAAGAGUGUGCUAAAGAA) 

30 

GLUT12 (also known as SLCA12: siRNA = 
SEQ. ID NO. 0459: GAGACACUCUGAAAUGAUA; 
SEQ. ID NO. 0460: GAAAUGAUGUGGAUAAGAG; 
SEQ. ID NO. 0461: GAUCAAAUCCUCCCUGAAA; 
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SEQ. ID NO. 0462: UGAAUGAGCUGAUGAUUGU) and other related transporters, 
may be of value in treating a multitude of malignancies. 

The siRNA sequences listed above are presented in a 5'^ 3' sense strand 
5 direction. In addition, siRNA directed against the targets listed above as well as those 
directed against other targets and listed in the accompanying compact disk may be 
useful as therapeutic agents. 

Example VIII. Evidence for the Benefits of Pooling 

1 0 Evidence for the benefits of pooling have been demonstrated using the 

reporter gene, luciferase. Ninety siRNA duplexes were synthesized using Dharmacon 
proprietary ACE® chemistry against one of the standard reporter genes: firefly 
luciferase. The duplexes were designed to start two base pairs apart and to cover 
approximately 180 base pairs of the luciferase gene (see sequences in Table III). 

15 Subsequently, the siRNA duplexes were co-transfected with a luciferase expression 
reporter plasmid into HEK293 cells using standard transfection protocols and 
luciferase activity was assayed at 24 and 48 hours. 

Transfection of individual siRNAs showed standard distribution of inhibitory 
20 effect. Some duplexes were-aetive, while others were not. Figure 15 represents a 
typical screen of ninety siRNA duplexes (SEQ. ID NO. 0032- 0120) positioned two 
base pairs apart. As the figure suggests, the functionality of the siRNA duplex is 
determined more by a particular sequence of the oligonucleotide than by^therelative 
oligonucleotide position within a gene or excessively sensitive part of the mRNA, 
25 which is important for traditional anti-sense technology. 

When two continuous oligonucleotides were pooled together, a significant 
increase in gene silencing activity was observed. (See Figure 16) A gradual increase 
in efficacy and the frequency of pools functionality was observed when the number of 
siRNAs increased to 3 and 4. (Figures 16, 17). Further, the relative positioning of 
the oligonucleotides within a pool did not determine whether a particular pool was 
functional (see Figure 18, in which 100% of pools of oligonucleotides distanced by 2, 
10 and 20 base pairs were functional). 



30 
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However, relative positioning may nonetheless have an impact. An increased 
functionality may exist when the siRNA are positioned continuously head to toe (5 5 
end of one directly adjacent to the 3 5 end of the others). 

5 Additionally, siRNA pools that were tested performed at least as well as the 

best oligonucleotide in the pool, under the experimental conditions whose results are 
depicted in Figure 19. Moreover, when previously identified non-functional and 
marginally (semi) functional siRNA duplexes were pooled together in groups of five 
at a time, a significant functional cooperative action was observed. (See Figure 20) 
10 In fact, pools of semi-active oligonucleotides were 5 to 25 times more functional than 
the most potent oligonucleotide in the pool. Therefore, pooling several siRNA 
duplexes together does not interfere with the functionality of the most potent siRNAs 
within a pool, and pooling provides an unexpected significant increase in overall 
functionality 

15 

Example IX. Pooling Across Species 

Experiments were performed on the following genes: (3-galactosidase, Renilla 
luciferase, and Secreted alkaline phosphatase, which demonstrates the benefits of 
pooling, (see Figure 21) Approximately 50% of individual siRNAs designed to 
20 silence the above-specified geneufwere functional, while 100% of the pools that 
contain the same siRNA duplexes were functional. 

Example X. Highly Functional siRNA >^-,^^ 

Pools of five siRNAs in which each two siRNAs overlap to 10-90% resulted 
25 in 98% functional entities (>80% silencing). Pools of siRNAs distributed throughout 
the mRNA that were evenly spaced, covering an approximate 20 - 2000 base pair 
range, were also functional. When the pools of siRNA were positioned continuously 
head to tail relative to mRNA sequences and mimicked the natural products of Dicer 
cleaved long double stranded RNA, 98% of the pools evidenced highly functional 
30 activity (>95% silencing). 



Example XI. Human cyclophyline 
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Table III above lists the siRNA sequences for the human cyclophyline 
protein. A particularly functional siRNA may be selected by applying these 
sequences to any of Formula I to VII above. 

5 Alternatively, one could pool 2, 3, 4, 5 or more of these sequences to create a 

kit for silencing a gene. Preferably, within the kit there would be at least one 
sequence that has a relatively high predicted functionality when any of Formulas I - 
VII is applied. 

10 Example XII. Sample Pools of siRNAs and Their Application to Human Disease 

The genetic basis behind human disease is well documented and siRNA may 
be used as both research or diagnostic tools and therapeutic agentsr either individually 
or in pools. Genes involved in signal transduction, the immune response, apoptosis, 
DNA repair, cell cycle control, and a variety of other physiological functions have 
15 clinical relevance and therapeutic agents that can modulate expression of these genes 
may alleviate some or all of the associated symptoms. In some instances, these genes 
can be described as a member of a family or class of genes and siRNA (randomly, 
conventionally, or rationally designed) can be directed against one or multiple 
members of the family to induce a desired result. 

20 

To identify rationally designed siRNA to each gene, the sequence was 
analyzed using Formula VIII to identify a SMARTpool containing the functional 
sequences. To confirm the activity of these sequences, the siRNA are introduced into 
a cell type of choice (e.g. HeLa cells, HEK293 cells) and the levels of the appropriate 

25 message are analyzed using one of several art proven techniques. SiRNA having 

heightened levels of potency can be identified by testing each of the before mentioned 
duplexes at increasingly limiting concentrations. Similarly, siRNA having increased 
levels of longevity can be identified by introducing each duplex into cells and testing 
functionality at 24, 48, 72, 96, 120, 144, 168, and 192 hours after transfection. Agents 

30 that induce >95% silencing at sub-nanomolar concentrations and/or induce functional 
levels of silencing for >96 hours are considered hyperfunctional. 

The following are non-limiting examples of families of proteins to which 
siRNA described in this document are targeted against: 
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Transporters, Pumps, and Channels 

5 Transporters, pumps, and channels represent one class of genes that are 

attractive targets for siRNAs. One major class of transporter molecules are the ATP- 
binding cassette (ABC) transporters. To date, nearly 50 human ABC-transporter 
genes have been characterized and have been shown to be involved in a variety of 
physiological functions including transport of bile salts, nucleosides, chloride ions, 

10 cholesterol, toxins, and more. Predominant among this group are MDR1 (which 

encodes the P-glycoprotein, NP_000918), the MDR-related proteins (MRP 1-7), and 
the breast cancer resistance protein (BCRP). In general, these transporters share a 
common structure, with each protein containing a pair of ATP-binding domains (also 
known as nucleotide binding folds, NBF) and two sets of transmembrane (TM) 

15 domains, each of which typically contains six membrane-spanning a-helices. The 
genes encoding this class of transporter are organized as either full transporters {i.e. 
containing two TM and two NBF domains) or as half transporters that assemble as 
either homodimers or heterodimers to create functional transporters. As a whole, 
members of the family are widely dispersed throughout the genome and show a high 

20 degree of amino acid sequence identify among eukaryotes. 

ABC-transporters have been implicated in several human diseases. For 
instance, molecular efflux pumps of this type play a major role in the development of 
drug resistance exhibited by a variety of cancers and pathogenic microorganisms. In 

25 the case of human cancers, increased expression of the MDR1 gene and related pumps 
have been observed to generate drug resistance to a broad collection of commonly 
used chemotherapeutics including doxorubicin, daunorubicin, vinblastine, vincristine, 
colchicines. In addition to the contribution these transporters make to the 
development of multi-drug resistance, there are currently 13 human genetic diseases 

30 associated with defects in 14 different transporters. The most common of these 

conditions include cystic fibrosis, Stargardt disease, age-related macular degeneration, 
adrenoleukodystrophy, Tangier disease, Dubin- Johnson syndrome and progressive 
familial intrahepatic cholestasis. For this reason, siRNAs directed against members of 
this, and related, families are potentially valuable research and therapeutic tools. 



35 
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With respect to channels, analysis of Drosophila mutants has enabled the 
initial molecular isolation and characterization of several distinct channels including 
(but not limited to) potassium (K+) channels. This list includes shaker (Sh), which 
encodes a voltage activated K + channel, slowpoke (Slo), a Ca 2+ activated K + channel, 
5 and ether-a-go-go (Eag). The Eag family is further divided into three subfamilies: 
Eag, Elk (eag-like K channels), and Erg (Eag related genes). 

The Erg subfamily contains three separate family members (Ergl-3) that are 
distantly related to the sh family of voltage activated K + channels. Like sh, erg 

1 0 polypetides contain the classic six membrane spanning architecture of K + channels 
(S1-S6) but differ in that each includes a segment associated with the C-terminal 
cytoplasmic region that is homologous to cyclic nucleotide binding domains (cNBD). 
Like many isolated ion channel mutants, erg mutants are temperature-sensitive 
paralytics, a phenotype caused by spontaneous repetitive firing (hyperactivity) in 

15 neurons and enhanced transmitter release at the neuromuscular junction. 

Initial studies on the tissue distribution of all three members of the erg 
subfamily show two general patterns of expression. Ergl and erg3 are broadly 
expressed throughout the nervous system and are observed in the heart, the superior 

20 mesenteric ganglia, the celiac ganglia, the retina, and the brain. In contrast, erg2 
shows a much more restricted pattern of expression and is only observed in celiac 
ganglia and superior mesenteric ganglia. Similarly, the kinetic properties of the three 

v erg potassium channels are not homogeneous. Ergl and erg2 channels are relatively 
slow activating delayed rectifiers whereas the erg3 current activates rapidly and then 

25 exhibits a predominantly transient component that decays to a sustained plateau. The 
current properties of all three channels are sensitive to methanesulfonanilides, 
suggesting a high degree of conservation in the pore structure of all three proteins. 

Recently, the erg family of K + channels has been implicated in human disease. 
30 Consistent with the observation that ergl is expressed in the heart, single strand 
conformation polymorphism and DNA sequence analyses have identified HERG 
(human ergl) mutations in six long-QT-syndrome (LQT) families, an inherited 
disorder that results in sudden death from a ventricular tachyarrythmia. Thus siRNA 
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directed against this group of molecules (e.g. KCNH1-8) will be of extreme 
therapeutic value. 



Another group of channels that are potential targets of siRNAs are 
5 the CLCA family that mediate a Ca 2+ -activated CT conductance in a variety of 
tissues. To date, two bovine (bCLCl; bCLCA2 (Lu-ECAM-1)), three mouse 
(mCLCAl; mCLCA2; mCLCA3) and four human (hCLCAl; hCLCA2; hCLCA3; 
hCLCA4) CLCA family members have been isolated and patch-clamp studies with 
transfected human embryonic kidney (HEK-293) cells have shown that bCLCAl, 
1 0 mCLCAl , and hCLCAl mediate a Ca 2+ -activated CT conductance that can be 
inhibited by the anion channel blocker DIDS and the reducing agent dithiothreitol 
(DTT). 

The protein size, structure, and processing seem to be similar among different 
1 5 CLCA family members and has been studied in greatest detail for Lu-ECAM- 1 . The 
Lu-ECAM-1 open reading frame encodes a precursor glycoprotein of 130 kDa that is 
processed to a 90-kDa amino-terminal cleavage product and a group of 30- to 40-kDa 
glycoproteins that are glycosylation variants of a single polypeptide derived from its 
carboxy terminus. Both subunits are associated with the outer cell surface, but only 
20 the 90-kDa subunit is thought to be anchored to.the cell membrane via four 
transmembrane domains. 



Although the protein processing and function appear to be conserved among 
CLCA homologs, significant differences exist in their tissue expression patterns. For 

25 example, bovine Lu-ECAM-1 is expressed primarily in vascular endothelia, bCLCAl 
is exclusively detected in the trachea, and hCLCAl is selectively expressed in a 
subset of human intestinal epithelial cells. Thus the emerging picture is that of a 
multigene family with members that are highly tissue specific, similar to the C1C 
family of voltage-gated CT channels. The human channel, hCLCA2, is particular 

30 interesting from a medical and pharmacological standpoint. CLCA2 is expressed on 
the luminal surface of lung vascular endothelia and serves as an adhesion molecule 
for lung metastatic cancer cells, thus mediating vascular arrest and lung colonization. 
Expression of this molecule in normal mammary epithelium is consistently lost in 
human breast cancer and in nearly all tumorigenic breast cancer cell lines. Moreover, 
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re-expression of hCLCA2 in human breast cancer cells abrogates tumorigenicity in 
nude mice, implying that hCLCA2 acts as a tumour suppressor in breast cancer. For 
these reasons, siRNA directed against CLCA family members and related channels 
may prove to be valuable in research and therapeutic venues. 

Transporters Involved in Synaptic Transmission. 

Synaptic transmission involves the release of a neurotransmitter into the 
synaptic cleft, interaction of that transmitter with a postsynaptic receptor, and 
subsequent removal of the transmitter from the cleft. In most synapses the signal is 
terminated by a rapid reaccumulation of the neurotransmitter into presynaptic 
terminals. This process is catalyzed by specific neurotransmitter transporters that are 
often energized by the electrochemical gradient of sodium across the plasma 
membrane of the presynaptic cells. 



Aminobutyric acid (GABA) is the major inhibitory neurotransmitter in the 
central nervous system. The inhibitory action of GABA, mediated through GABA A / 
GABAb receptors, and is regulated by GABA transporters (GATs), integral 
membrane proteins located perisynaptically on neurons and glia. So far four different 
carriers (GAT1-GAT4) have been cloned and their cellular distribution has been 
partly worked out. Comparative sequence analysis has revealed that GABA 
transporters are related to several other proteins involved in neurotransmitter uptake 
including gamma-aminobutyric acid transporters, monoamine transporters, amino 
acid transporters, certain "orphan" transporters, and the recently discovered bacterial 
transporters. Each of these proteins has a similar 12 transmembrane helices topology 
and relies upon the Na+/Cl- gradient for transport function. Transport rates are 
dependent on substrate concentrations, with half-maximal effective concentrations for 
transport frequently occurring in the submicromolar to low micromolar range. In 
addition, transporter function is bidirectional, and non-vesicular efflux of transmitter 
may contribute to ambient extracellular transmitter levels. 

Recent evidence suggests that GABA transporters, and neurotransmitter 
transporters in general, are not passive players in regulating neuronal signaling; rather, 
transporter function can be altered by a variety of initiating factors and signal 
transduction cascades. In general, this functional regulation occurs in two ways, 
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either by changing the rate of transmitter flux through the transporter or by changing 
the number of functional transporters on the plasma membrane. A recurring theme in 
transporter regulation is the rapid redistribution of the transporter protein between 
intracellular locations and the cell surface. In general, this functional modulation 
occurs in part through activation of second messengers such as kinases, phosphatases, 
arachidonic acid, and pH. However, the mechanisms underlying transporter 
phosphorylation and transporter redistribution have yet to be fully elucidated. 

GABA transporters play a pathophysiological role in a number of human 
diseases including temporal lobe epilepsy and are the targets of pharmacological 
interventions. Studies in seizure sensitive animals show some (but not all) of the GAT 
transporters have altered levels of expression at times prior to and post seizure, 
suggesting this class of transporter may affect epileptogenesis, and that alterations 
following seizure may be compensatory responses to modulate seizure activity. For 
these reasons, siRNAs directed against members of this family of genes (including but 
not limited to SLCG6A1-12) may prove to be valuable research and therapeutic tools. 

Organic Ion Transporters. 

The human body is continuously exposed to a great variety of xenobiotics, via 
food, drugs, occupation, and environment. Excretory organs such as kidney, liver, and 
intestine defend the body against the potentially harmful effects of these compounds 
by transforming them into less active metabolites that are subsequently secreted from 
the system. 

Carrier-mediated transport of xenobiotics and their metabolites exist for the 
active secretion of organic anions and cations. Both systems are characterized by a 
high clearance capacity and tremendous diversity of substances accepted, properties 
that result from the existance of multiple transporters with overlapping substrate 
specificities. The class of organic anion transporters plays a critical role in the 
elimination of a large number of drugs (e.g., antibiotics, chemotherapeutics, diuretics, 
nonsteroidal anti-inflammatory drugs, radiocontrast agents, cytostatics); drug 
metabolites (especially conjugation products with glutathione, glucuronide, glycine, 
sulfate, acetate); and toxicants and their metabolites (e.g., mycotoxins, herbicides, 
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plasticizers, glutathione S-conjugates of polyhaloalkanes,polyhaloalkenes, 
hydroquinones, aminophenols), many of which are specifically harmful to the kidney. 

Over the past couple of years the number of identified anion transporting 
molecules has grown tremendously. Uptake of organic anions (OA~) across the 
basolateral membrane is mediated by the classic sodium-dependent organic anion 
transport system, which includes ct-ketoglutarate (a-KG 2 ")/OA" exchange via the 
organic anion transporter (OAT1) and sodium-ketoglutarate cotransport via the 
NaVdicarboxylate cotransporter (SDCT2). The organic anion transporting polypetide, 
Oatpl, and the kidney-specific OAT-K1 and OAT-K2 are seen as potential molecules 
that mediate facilitated OA" efflux but could also be involved in reabsorption via an 
exchange mechanism. Lastly the PEPT1 and PEPT2 mediate luminal uptake of 
peptide drugs, whereas CNT1 and CNT2 are involved in reabsorption of nucleosides 

The organic anion-transporting polypeptide 1 (Oatpl) is a Na + - and ATP- 
independent transporter originally cloned from rat liver. The tissue distribution and 
transport properties of the Oatpl gene product are complex. Oatpl is localized to the 
basolateral membrane of hepatocytes, and is found on the apical membrane of S3 
proximal tubules. Studies with transiently transfected cells (e.g. HeLa cells) have 
indicated that Oatpl mediates transport of a variety of molecules including 
taurocholate, estrone-3 -sulfate, aldosterone, Cortisol, and others. The observed uptake 
of taurocholate by Oatpl expressed in X. laevis oocytes is accompanied by efflux of 
GSH, suggesting that transport by this molecule may be glutathione dependent. 

Computer modeling suggests that members of the Oatp family are highly 
conserved, hydrophobic, and have 12 transmembrane domains. Decreases in 
expression of Oatp family members have been associated with cholestatic liver 
diseases and human hepatoblastomas, making this family of proteins of key interest to 
researchers and the medical community. For these reasons, siRNAs directed against 
OAT family members (including but not limited to SLC21A2, 3,6, 8, 9, 11, 12, 14, 
15, and related transporters) are potentially useful as research and therapeutic tools. 



Nucleoside transporters. 
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Nucleoside transporters play key roles in physiology and pharmacology. 
Uptake of exogenous nucleosides is a critical first step of nucleotide synthesis in 
tissues such as bone marrow and intestinal epithelium and certain parasitic organisms 
that lack de novo pathways for purine biosynthesis. Nucleoside transporters also 
5 control the extracellular concentration of adenosine in the vicinity of its cell surface 
receptors and regulate processes such as neurotransmission and cardiovascular 
activity. Adenosine itself is used clinically to treat cardiac arrhythmias, and nucleoside 
transport inhibitors such as dipyridamole, dilazep, and draflazine function as coronary 
vasodilators. 

10 

In mammals, plasma membrane transport of nucleosides is brought about by 
members of the concentrative, Na + -dependent (CNT) and equilibrative, Na + - 
independent (ENT) nucleoside transporter families. CNTs are expressed in a tissue- 
specific fashion; ENTs are present inmost, possibly all, cell types and are responsible 

1 5 for the movement of hydrophilic nucleosides and nucleoside analogs down their 
concentration gradients. In addition, structure/function studies of ENT family 
members have predicted these molecules to contain eleven transmembrane helical 
segments with an amino terminus that is intracellular and a carboxyl terminus that is 
extracellular. The proteins have a large glycosylated loop between TMs 1 and 2 and 

20 a large cytoplasmic loop between TMs 6 and 7. Recent investigations have implicated 
the TM 3-6 region as playing a central role in solute recognition. The medical 
importance of the ENT family of proteins is broad. In humans adenosine exerts a 
range ofxardiaprotective effects and inhibitors of ENTs are seen as being valuable in 
alleviating a variety of cardio/cardiovascular ailments. In addition, responses to 

25 nucleoside analog drugs has been observed to vary considerably amongst e.g. cancer 
patients. While some forms of drug resistance have been shown to be tied to the up- 
regulation of ABC-transporters {e.g. MDR1), resistance may also be the result of 
reduced drug uptake {i.e. reduced ENT expression). Thus, a clearer understanding of 
ENT transporters may aid in optimizing drug treatments for patients suffering a wide 

30 range of malignancies. For these reasons, siRNAs directed against this class of 

molecules (including SLC28A1-3, SLC29A1-4, and related molecules) maybe useful 
as therapeutic and research tools. 



Sulfate Transporters. 
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All cells require inorganic sulfate for normal function. Sulfate is the fourth 
most abundant anion in human plasma and is the major source of sulfur in many 
organisms. Sulfation of extracellular matrix proteins is critical for maintaining 
normal cartilage metabolism and sulfate is an important constituent of myelin 
5 membranes found in the brain 



Because sulfate is a hydrophilic anion that cannot passively cross the lipid 
bilayer of cell membranes, all cells require a mechanism for sulfate influx and efflux 
to ensure an optimal supply. To date, a variety of sulfate transporters have been 

1 0 identified in tissues from many origins. These include the renal sulfate transporters 
(NaSi-1 and Sat-1), the ubiquitously expressed diastrophic dysplasia sulfate 
transporter (DTDST), the intestinal sulfate transporter (DRA), and the erythrocyte 
anion exchanger (AE1). Most, if not all, of these molecules contain the classic 12 
transmembrane spanning domain architecture commonly found amongst members of 

1 5 the anion transporter superfamily. 

Recently three different sulfate transporters have been associated with specific 
human genetic diseases. Family members SLC26A2, SLC26A3, and SLC26A4 have 
been recognized as the disease genes mutated in diastrophic dysplasia, congenital 

20 chloride diarrhea (CLD), and Pendred syndrome (PDS), respectively^ DTDST is a 

particularly complex disorder. The gene encoding this molecule maps to chromosome 
5q, and encodes two distinct transcripts due to alternative exon usage. In contrast to 
other sulfate -transporters (e.g. Sat-1) anion movement by the DTDST protein is 
markedly inhibited by either extracellular chloride or bicarbonate. Impaired function 

25 of the DTDST gene product leads to undersulfation of proteoglycans and a complex 
family of recessively inherited osteochondrodysplasias (achondrogenesis type IB, 
atelosteogenesis type II, and diastrophic dysplasia) with clinical features including but 
not limited to, dwarfism, spinal deformation, and specific joint abnormalities. 
Interestingly, while epidemiological studies have shown that the disease occurs in 

30 most populations, it is particularly prevalent in Finland owing to an apparent founder 
effect. For these reasons, siRNAs directed against this class of genes (including but 
not limited to SLC26A1 -9, and related molecules) may be potentially helpful in both 
therapeutic and research venues. 
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Ion Exchangers 

Intracellular pH regulatory mechanisms are critical for the maintenance of 
countless cellular processes. For instance, in muscle cells, contractile processes and 
metabolic reactions are influenced by pH. During periods of increased energy 
demands and ischemia, muscle cells produce large amounts of lactic acid that, without 
quick and efficient disposal, would lead to acidification of the sarcoplasm. 



Several different transport mechanisms have evolved to maintain a relatively 
constant intracellular pH. The relative contribution of each of these processes varies 

10 with cell type, the metabolic requirements of the cell, and the local environmental 
conditions. Intracellular pH regulatory processes that have been characterized 
functionally include but are not limited to the Na + /H + exchange, the Na(HC0 3 )„ 
cotransport, and the Na + -dependent and -independent CP/base exchangers. As 
bicarbonate and C0 2 comprise the major pH buffer of biological fluids, sodium 

1 5 biocarbonate cotransporters (NBCs) are critical. Studies have shown that these 

molecules exist in numerous tissues including the kidney, brain, liver, cornea, heart, 
and lung, suggesting that NBCs play an important role in mediating HC0 3 " transport 
in both epithelial as well as nonepithelial cells. 

20 Recent molecular cloning experiments have identified the existence of four 

NBC isoforms (NBC1, 2, 3 and 4) and two NBC-related proteins, AE4 and NCBE 
(Anion Exchanger 4 and Na-dependent Chloride-Bicarbonate Exchanger). The 
secondary structuEe.anatyses and hydropathy profile of this family predict them to be 
intrinsic membrane proteins with 12 putative transmembrane domains and several 

25 family members exhibit A-linked glycosylation sites, protein kinases A and C, casein 
kinase II, and ATP/GTP-binding consensus phosphorylation sites, as well as potential 
sites for myristylation and amidation. AE4 is a relatively recent addition to this 
family of proteins and shows between 30-48% homology with the other family 
members. When expressed in COS-7 cells and Xenopus oocytes AE4 exhibits sodium- 

30 independent and DIDS-insensitive anion exchanger activity. Exchangers have been 
shown to be responsible for a variety of human diseases. For instance, mutations in 
three genes of the anion transporter family (SLC) are believed to cause known 
hereditary diseases, including chondrodysplasia (SLC26A2, DTD), diarrhea (A3, 
down-regulated in adenoma/chloride-losing diarrhea protein: DRA/CLD), and 
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goiter/deafness syndrome (A4, pendrin). Moreover, mutations in Na+/HC03 co- 
transporters have also been associated with various human maladies. For these 
reasons, siRNAs directed against these sorts of genes (e.g. SLC4A4-10, and related 
genes) may be useful for therapeutic and research purposes. 

5 

Receptors Involved in Synaptic Transmission 

In all vertebrates, fast inhibitory synaptic transmission is the result of the 
interaction between the neurotransmitters glycine (Gly) and y-aminobutyric acid 
(GABA) and their respective receptors. The strychnine-sensitive glycine receptor is 
1 0 especially important in that it acts in the mammalian spinal cord and brain stem and 
has a well-established role in the regulation of locomotor behavior. 

Glycine receptors display significant sequence homology to several other 
receptors including the nicotinic acetylcholine receptor, the aminobutyric acid 

1 5 receptor type A (GABA A R), and the serotonin receptor type 3 (5-HT 3 R) subunits. As 
members of the superfamily of ligand-gated ion channels, these polypeptides share 
common topological features. The glycine receptor is composed of two types of 
glycosylated integral membrane proteins (al-a4 and P) arranged in a pentameric 
suprastructure. The alpha subunit encodes a large extracellular, N-terminal domain 

20 that carries the structural determinants essential for agonist and antagonist-binding, 

followed by four transmembrane spanning regions (TM1-TM4), with TM2 playing the 
critical role of forming the inner wall of the chloride channel. 



The density, location, and subunit composition of glycine neurotransmitter 
25 receptors changes over the course of development. It has been observed that the 

amount of GlyR gene translation (assessed by the injection of developing rat cerebral 
cortex mRNA into Xenopus oocytes) decreases with age, whereas that of GABARs 
increases. In addition, the type and location of mRNAs coding for GlyR changes over 
the course of development. For instance in a study of the expression of alpha 1 and 
3 0 alpha 2 subunits in the rat, it was observed that (in embryonic periods E 1 1 - 1 8) the 

mantle zone was scarce in the alpha 1 mRNA, but the germinal zone (matrix layer) at 
El 1-14 expressed higher levels of the message. At postnatal day 0 (P0), the alpha 1 
signals became manifested throughout the gray matter of the spinal cord. By contrast, 
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the spinal tissues at PO exhibited the highest levels of alpha 2 mRNA, which 
decreased with the postnatal development. 

In both, man and mouse mutant lines, mutations of GlyR subunit genes result 
5 in hereditary motor disorders characterized by exaggerated startle responses and 

increased muscle tone. Pathological alleles of the Glral gene are associated with the 
murine phenotypes oscillator (spd 01 ) and spasmodic (spd). Similarly, a mutant allele 
of Glrb has been found to underly the molecular pathology of the spastic mouse (spa). 
Resembling the situation in the mouse, a variety of GLRA1 mutant alleles have been 
1 0 shown to be associated with the human neurological disorder hyperekplexia or startle 
disease. For these reasons, siRNA directed against glycine receptors (GLRA1-3, 
GLRB, and related molecules), glutamate receptors, GABA receptors, ATP receptors, 
and related neurotransmitter receptor molecules may be valuable therapeutic and 
research reagents. 

15 

Proteases 

Kallikreins 

One important class of proteases are the kallikreins, serine endopeptidases that 
20 split peptide substrates preferentially on the C-terminal side of internal arginyl and 
lysyl residues. Kallikreins are generally divided into two distinct groups, plasma 
kallikreins and tissue kallikreins. Tissue kallikreins represent a large group of 
enzymes that have substantial similarities at both the gene and protein level. The 
genes encoding this group are frequently found on a single chromosome, are 
25 organized in clusters, and are expressed in a broad range of tissues (e.g. pancreas, 
ovaries, breast). In contrast, the plasma form of the enzyme is encoded by a single 
gene (e.g. KLK3) that has been localized to chromosome 4q34-35 in humans. The 
gene encoding plasma kallikrein is expressed solely in the liver, contains 15 exons, 
and encodes a glycoprotein that is translated as a preprotein called prekallikrein. 

30 

Kallikreins are believed to play an important role in a host of physiological 
events For instance, the immediate consequence of plasma prekallikrein activation is 
the cleavage of high molecular weight kininogen (HK) and the subsequent liberation 
of bradykinin, a nine amino acid vasoactive peptide that is an important mediator of 



WO 2004/045543 PCT/US2003/036787 

92 

inflammatory responses. Similarly, plasma kallikrein promotes single-chain urokinase 
activation and subsequent plasminogen activation, events that are critical to blood 
coaggulation and wound healing. 



5 Disruptions in the function of kallikreins have been implicated in a variety of 

pathological processes including imbalances in renal function and inflammatory 
processes. For these reasons, siRNAs directed against this class of genes (e.g. KLK1- 
1 5) may prove valuable in both research and therapeutic settings. 

10 ADAM Proteins 

The process of fertilization takes place in a series of discrete steps whereby the 
sperm interacts with.71) the cumulus cells and the hyaluronic acid extracellular matrix 
(ECM) in which they are embedded, ii) the egg's own ECM, called the zona pellucida 
(ZP), and iii) the egg plasma membrane. During the course of these interactions, the 

1 5 "acrosome reaction," the exocytosis of the acrosome vesicle on the head of the sperm, 
is induced, allowing the sperm to penetrate the ZP and gain access to the perivitelline 
space. This process exposes new portions of the sperm membrane, including the inner 
acrosomal membrane and the equatorial segment, regions of the sperm head that can 
participate in initial gamete membrane binding. 

20 

The interactions of the gamete plasma membranes appear to involve multiple 
ligands and receptors and are frequently compared to leukocyte-endothelial 
interactions. These interactions lead- to a series of signal transduction events in the 
egg, known as collectively as egg. activation and include the initiation of oscillations 
25 in intracellular calcium concentration, the exit from meiosis, the entry into the first 
embryonic mitosis, and the formation of a block to polyspermy via the release of ZP- 
modifying enzymes from the egg's cortical granules. Ultimately, sperm and egg not 
only adhere to each other but also go on to undergo membrane fusion, making one 
cell (the zygote) from two. 

30 

Studies on the process of sperm-egg interactions have identified a number of 
proteins that are crucial for fertilization. One class of proteins, called the ADAM 
family (A Disintegrin And Metalloprotease), has been found to be important in 
spermatogenesis and fertilization, as well as various developmental systems including 
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myogenesis and neurogenesis. Members of the family contain a disintegrin and 
metalloprotease domain (and therefore have (potentially) both cell adhesion and 
protease activities), as well as cysteine-rich regions, epidermal growth factor (EGF)- 
like domains, a transmembrane region, and a cytoplasmic tail. Currently, the ADAM 
5 gene family has 29 members and constituents are widely distributed in many tissues 
including the brain, testis, epididymis, ovary, breast, placenta, liver, heart, lung, bone, 
and muscle. 

One of the best-studied members of the ADAM family is fertilin, a 
1 0 heterodimeric protein comprised of at least two subunits, fertilin alpha and fertilin 
beta. The fertilin beta gene (ADAM2) has been disrupted with a targeting gene 
construct corresponding^ the exon encoding the fertilin beta disintegrin domain. 
Sperm from males homozygous for disruptions in this region exhibit defects in 
multiple facets of sperm function including reduced levels of s^erm transit from the 
15 uterus to the oviduct, reduced sperm-ZP binding, and reduced s^erm-egg binding, all 
of which contribute to male infertility. 

Recently, four new ADAM family members (ADAM 24-27) have bfeen 
isolated. The deduced amino acid sequences show that all four contain the complete 
20 domain organization common to ADAM family members and Northern Blot analysis 7 
has shown all four to be specific to the testes. SiRNAs directed against this class of 
genes (e.g. ADAM2 and related proteins) may be useful as research tools and 
therapeutics directed toward fertility amd.birth control. 

25 Aminopeptidases 

Aminopeptidases are proteases that play critical roles in processes such as 
protein maturation, protein digestion in its terminal stage, regulation of hormone 
levels, selective or homeostatic protein turnover, and plasmid stabilization. These 
enzymes generally have broad substrate specificity, occur in several forms and play a 

30 major role in physiological homeostasis. For instance, the effects of bradykinin, 

angiotensin converting enzyme (ACE), and other vasoactive molecules are muted by 
one of several peptidases that cleave the molecule at an internal position and eliminate 
its ability to bind its cognate receptor (e.g. for bradykinin, the B2-receptor). 
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Among the enzymes that can cleave bradykinin is the membrane bound 
aminopeptidase P, also referred to as aminoacylproline aminopeptidase, proline 
aminopeptidase; X-Pro aminopeptidase (eukaryote) and XPNPEP2. Aminopeptidase 
P is an aminoacylproline aminopeptidase specific for NH 2 -terminal Xaa-proline 
5 bonds. The enzyme i) is a mono-zinc-containing molecule that lacks any of the typical 
metal binding motifs found in other zinc metalloproteases, ii) has an active-site 
configuration similar to that of other members of the MG peptidase family, and iii) is 
present in a variety of tissues including but not limited to the lung, kidney, brain, and 
intestine. 

10 

Aminopeptidases play an important role in a diverse set of human diseases. 
Low plasma concentrations^ aminopeptidase P are a potential predisposing factor 
for development of angio-oedema in patients treated with ACE inhibitors, and 
inhibitors of aminopeptidase P may act as cardioprotectors against other forms of 
1 5 illness including, but not limited to myocardial infarction. For these reasons, siRNAs 
directed against this family of proteins (including but not limited to XPNPEP1 and 
related proteins) may be useful as research and therapeutic tools. 



Serine Proteases 

20 v One important class of proteases are the serine proteases. Serine proteases 

share a common catalytic triad of three amino acids in their active site (serine 
(nucleophile), aspartate (electrophile), and histidine (base)) and can hydrolyze either 
esters or peptide bonds utilizing mechanisms~of covalent catalysis and preferential 
binding of the transition state. Based on the position of their introns serine proteases 

25 have been classified into a minimum of four groups including those in which 1) the 
gene has no introns interrupting the exon coding for the catalytic triad (e.g. the 
haptoglobin gene,); 2) each gene contains an intron just downstream from the codon 
for the histidine residue at the active site, a second intron downstream from the exon 
containing the aspartic acid residue of the active site and a third intron just upstream 

30 from the exon containing the serine of the active site (e.g. trypsinogen, 

chymotrypsinogen, kallikrein and proelastase); 3) the genes contain seven introns 
interrupting the exons coding the catalytic region (e.g. complement factor B gene); 
and 4) the genes contain two introns resulting in a large exon that contains both the 
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active site aspartatic acid and serine residues (e.g. factor X, factor IX and protein C 
genes). 

Cytotoxic lymphocytes (e.g. CD8(+) cytotoxic T cells and natural killer cells) 
5 form the major defense of higher organisms against virus-infected and transformed 
cells. A key function of these cells is to detect and eliminate potentially harmful cells 
by inducing them to undergo apoptosis. This is achieved through two principal 
pathways, both of which require direct but transient contact between the killer cell and 
its target. The first pathway involves ligation of TNF receptor-like molecules such as 

1 0 Fas/CD95 to their cognate ligands, and results in mobilization of conventional, 

programmed cell-death pathways centered on activation of pro-apoptotic caspases. 
The second mechanism consists-of a pathway whereby the toxic contents of a 
specialized class of secretory vesicles are introduced into the target cell. Studies over 
the last two decades have identified the toxic components as Granzymes, a family of 

15 serine proteases that are expressed exclusively by cytotoxic T lymphocytes and 
natural killer (NK) cells. These agents are stored in specialized lytic granules and 
enter the target cell via endocytosis. Like caspases, cysteine proteases that play an 
important role in apoptosis, granzymes can cleave proteins after acidic residues, 
especially aspartic acid, and induce apoptosis in the recipient cell. 

20 - 

Granzymes have been grouped into three subfamilies according to substrate 
specificity. Members of the granzyme family that have enzymatic activity similar to 
the serine protease chymotrypsin are encodje<kby~a-gene cluster termed the 'chymase 
locus'. Similarly, granzymes with trypsin-like specificities are encoded by the 
25 'tryptase locus ? , and a third subfamily cleaves after unbranched hydrophobic residues, 
especially methionine, and are encoded by the Met-ase locus 1 . All granzymes are 
synthesized as zymogens and, after clipping of the leader peptide, obtain maximal 
enzymatic activity subsequent to the removal of an amino-terminal dipeptide. 

30 Granzymes have been found to be important in a number of important 

biological functions including defense against intracellular pathogens, graft versus 
host reactions, the susceptibility to transplantable and spontaneous malignancies, 
lymphoid homeostasis, and the tendency toward auto-immune diseases. For these 
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reasons, siRNAs directed against granszymes (e.g. GZMA, GZMB, GZMH, GZHK, 
GZMM) and related serine proteases may be useful research and therapeutic reagents. 

Kinases 

> Protein Kinases (PKs) have been implicated in a number of biological 

t processes. Kinase molecules play a central role in modulating cellular physiology and 

developmental decisions, and have been implicated in a large list of human maladies 

including cancer, diabetes, and others. 



15 



1 0 During the course of the last three decades, over a hundred distinct protein 

kinases have been identified, all with presumed specific cellular functions. A few of 
these enzymes have been isolated to sufficient purity to perform in vitro studies, but 
most remain intractable due to the low abundance of these molecules in the cell. To 
counter this technical difficulty, a number of protein kinases have been isolated by 
molecular cloning strategies that utilize the conserved sequences of the catalytic 
domain to isolate closely related homologs. Alternatively, some kinases have been 
purified (and subsequently studied) based on their interactions with other molecules. 

p58 is a member of the p34cdc2-related supergene family and contains a large 
20 domain that is highly homologous to the cell division control kinase, cdc2. This new 
cell division control-related protein kinase was originally identified as a component of 
semipurified galactosyltransferase; thus, it has been denoted galactosyltransferase- 
associated protein kinase (GTA-kinase). GTA=kinase,has been found to be expressed 
in both adult and embryonic tissues and is known to phosphorylate a number of 
25 substrates, including histone HI, and casein. Interestingly enough, over expression of 
this molecule in CHO cells has shown that elevated levels of p58 result in a prolonged 
late telophase and an early Gl phase, thus hinting of an important role for GTA- 
kinase in cell cycle regulation. 

30 Cvclin Dependent Kinases 

The cyclin-dependent kinases (Cdks) are a family of highly conserved 
serine/threonine kinases that mediate many of the cell cycle transitions that occur 
during duplication. Each of these Cdk catalytic subunits associates with a specific 
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subset of regulatory subunits, termed cyclins, to produce a distinct Cdk-cyclin kinase 
complex that, in general, functions to execute a unique cell cycle event. 

Activation of the Cdk-cyclin kinases during cellular transitions is controlled by 
a variety of regulatory mechanisms. For the Cdc2-cyclin B complex, inhibition of 
kinase activity during S phase and G 2 is accomplished by phosphorylation of two 
Cdc2 residues, Thr 14 and Tyr 15 , which are positioned within the ATP-binding cleft. 
Phosphorylation of Thr 14 and/or Tyr 15 suppresses the catalytic activity of the molecule 
by disrupting the orientation of the ATP present within this cleft. In contrast, the 
abrupt dephosphorylation of these residues by the Cdc25 phosphatase results in the 
rapid activation of Cdc2-cyclin B kinase activity and subsequent downstream mitotic 
events. While the exact details of this pathway have yet to be elucidated, it has been 
proposed that Thr 14 /Tyr 15 phosphorylation functions to permit a cell to attain a critical 
concentration of inactive Cdk-cyclin complexes, which, upon activation, induces a 
rapid and complete cell cycle transition. Furthermore, there is evidence in 
mammalian cells that Thr 14 /Tyr 15 phosphorylation also functions to delay Cdk 
activation after DNA damage. 

The Schizosaccharomyces pombe weel gene product was the first kinase 
identified that is capable of phosphorylating Tyr 15 in Cdc2. Homologs of the Weel 
kinase have been subsequently identified and biochemically characterized from a wide 
range of species including human, mouse, frog, Saccharomyces cerevisiae, and 
Drosophila. In vertebrate systems, where Thr 14 in=Gde&is.also phosphorylated, the 
Weel kinase was capable of phosphorylating Cdc2 on Tyr 15 , but not Thr 14 , indicating 
that another kinase was responsible for Thr 14 phosphorylation. This gene, Mytl 
kinase, was recently isolated from the membrane fractions of Xenopus egg extracts 
and has been shown to be capable of phosphorylating Thr 14 and, to a lessor extent, 
Tyr in Cdc2. A human Mytl homolog displaying similar properties has been 
isolated, as well as a non-membrane-associated molecule with Thr 14 kinase activity. 

In the past decade it has been shown that cancer can originate from 
overexpression of positive regulators, such as cyclins, or from underexpression of 
negative regulators {e.g. pl6 (INK4a), pl5 (INK4b), p21 (Cipl)). Inhibitors such as 
Mytl are the focus of much cancer research because they are capable of controlling 
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cell cycle proliferation, now considered the Holy Grail for cancer treatment. For these 
reasons, siRNA directed against kinases and kinase inhibitors including but not 
limited to ABL1, ABL2, ACK1, ALK, AXL, BLK, BMX, BTK, C20orf64, CSF1R, 
SCK, DDR1, DDR2, DKFZp761P1010, EGFR, EPHA1, EPHA2, EPHA3, EPHA4, 
5 EPHA7, EPHA8, EPHB1, EPHB2, EPHB3, EPHB4. EPHB6, ERBB2, ERBB3, 
ERBB4, FER, FES, FGFR1, FGFR2, FGFR3, FGFR4, FGR, FLT1, FLT3, FLT4, 
FRK, FYN, HCK, IGF1R, INSR, ITK, JAK1, JAK2, JAK3, KDR, KIAA1079, KIT, 
LCK, LTK, LYN, MATK, MERTK, MET, MST1R, MUSK, NTRK1, NTRK2, 
NTRK3, PDGFRA, PDGFRB, PTK2, PTK2B, PTK6, PTK7, PTK9, PTK9L, RET, 
10 ROR1, ROR2, ROS1, RYK, SRC, SYK, TEC, TEK, TIE, TNK1, TXK, TYK2, 
TYR03, YES1, and related proteins, may be useful for research and therapeutic 
purposes. 

G Protein Coupled Receptors 

15 

One important class of genes to which siRNAs can be directed are G-protein 
coupled receptors (GPCRs). GPCRs constitute a superfamily of seven transmembrane 
spanning proteins that respond to a diverse array of sensory and chemical stimuli, 
such as light, odor, taste, pheromones, hormones and neurotransmitters. GPCRs play a 
20 central role in cell proliferation, differentiation, and have been implicated in the 
etiology of disease. 

The mechanism by which G protein-coupled receptors translate extracellular 
signals into cellular changes was initially envisionecfls"a simple linear model: 

25 activation of the receptor by agonist binding leads to dissociation of the heterotrimeric 
GTP -binding G protein (Gs, Gi, or Gq) into its alpha and beta/gamma subunits, both 
of which can activate or inhibit various downstream effector molecules. More 
specifically, activation of the GPCR induces a conformational change in the Ga 
subunit, causing GDP to be released and GTP to be bound in its place. The Ga and 

30 Gpy subunits then dissociate from the receptor and interact with a variety of effector 
molecules. For instance in the case of the Gs family, the primary function is to 
stimulate the intracellular messenger adenylate cyclase (AC), which catalyzes the 
conversion of cytoplasmic ATP into the secondary messenger cyclic AMP (cAMP). 
In contrast, the Gi family inhibits this pathway and the Gq family activates 



WO 2004/045543 PCT/US2003/036787 

99 

phospholipases C (PLC), which cleaves phosphatidylinositol 4,5, bisphosphate (PIP2) 
to generate inositol- 1,4,5-phosphate (IP3) and diacylglycerol (DAG). 

More recently, studies have shown that the functions of GPCRs are not limited 
5 to their actions on G-proteins and that considerable cross-talk exists between this 
diverse group of receptor molecules and a second class of membrane bound proteins, 
the receptor tyrosine kinases (RTKs). A number of GPCRs such as endothelin-1, 
thrombin, bombesin, and dopamine receptors can activate MAPKs, a downstream 
effector of the RTKVRas pathway. Interestingly, the interaction between these two 
1 0 families is not unidirectional and RTKs can also modulate the activity of signaling 
pathways traditionally thought to be controlled exclusively by ligands that couple to 
GPCRs. For instance, EGF, which normally activates the MAPK cascade via the 
EGF receptor can stimulate adenylate cyclase activity by activating Gas. 

1 5 There are dozens of members of the G Protein-Coupled Receptor family that 

have emerged as prominent drug targets in the last decade. One non-limiting list of 
potential GPCR-siRNA targets is as follows: 

CMKLR1 

20 CML1/ GMKLR1 (Accession No. Q99788) is a member of the chemokine 

receptor family of GPCRs that may play a role in a number of diseases including 
those involved in inflammation and immunological responses (e.g. asthma, arthritis). 
For this reason, siRNA directed against this protein may-prove to- be important 
therapeutic reagents. 

25 

Studies of juvenile-onset neuronal ceroid lipofuscinosis (JNCL, Batten 
disease), the most common form of childhood encephalopathy that is characterized by 
progressive neural degeneration, show that it is brought on by mutations in a novel 
lysosomal membrane protein (CLN3). In addition to being implicated in JNCL, 
30 CLN3 (GPCR-like protein, Accession No. A572 1 9) expression studies have shown 
that the CLN3 mRNA and protein are highly over-expressed in a number of cancers 
(e.g. glioblastomas, neuroblastomas, as well as cancers of the prostate, ovaries, breast, 
and colon) suggesting a possible contribution of this gene to tumor growth. For this 
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reason, siRNA directed against this protein may prove to be important therapeutic 
reagents. 



CLACR 

The calcitonin receptor (CTR/ CALCR, Accession No. NM_001742) belongs 
to "family B" of GPCRs which typically recognized regulatory peptides such as 
parathyroid hormone, secretin, glucagons and vasoactive intestinal polypeptide. 
Although the CT receptor typically binds to calcitonin (CT), a 32 amino acid peptide 
hormone produced primarily by the thyroid, association of the receptor with RAMP 
(Receptor Activity Modulating Protein) enables it to readily bind other members of 
the calcitonin peptide family including amylin (AMY) and other CT gene-related 
peptides (e.g. aCGRP and PCGRP). While the primary function of the calcitonin 
receptor pertains to regulating osteoclast mediated bone resorption and enhanced Ca +2 
excretion by the kidney, recent studies have shown that CT and CTRs may play an 
important role in a variety of processes as wide ranging as embryonic/foetal 
development and sperm function/physiology. In addition, studies have shown that 
patients with particular CTR genotypes may be at higher risk to lose bone mass and 
that this GPCR may contribute to the formation of calcium oxalate urinary stones. For 
this reason, siRNA directed against CTR may be useful as therapeutic reagents. 

OXTR 

The human oxytocin receptor (OTR, OXTR) is a 389 amino acid polypeptide 
that exhibits the seven transmembrane domain structure and belong-s-to the Class-I 
(rhodopsin-type) family of G-protein coupled receptors. OTR is expressed in a wide 
variety of tissues throughout development and mediates physiological changes 
through G(q) proteins and phospholipase C-beta. Studies on the functions of oxytocin 
and the oxytocin receptor have revealed a broad list of duties. OT and OTR play a 
role in a host of sexual, maternal and social behaviors that include egg-laying, birth, 
milk-letdown, feeding, grooming, memory and learning. In addition, it has been 
hypothesized that abnormalities in the functionality of oxytocin-OTR receptor-ligand 
system can lead to a host of irregularities including compulsive behavior, eating 
disorders (such as anorexia), depression, and various forms of neurodegenerative 
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diseases. For these reasons, siRNA directed against this gene (NMJ)00916) may play 
an important role in combating OTR-associated illnesses. 



EDG GPCRs 

5 Lysophosphatidic acid and other lipid-based hormones/growth factors induce 

their effects by activating signaling pathways through the G-protein coupled receptors 
(GPCRs) and have been observed to play important roles in a number of human 
diseases including cancer, asthma, and vascular pathologies. For instance, during 
studies of immunoglobulin A nephropathy (IgAN), researchers have observed an 
10 enhanced expression of EDG5 (NPJ)04221) suggesting a contribution of this gene 
product in the development of IgAN. For that reasons, siRNA directed against Edg5 
(NM_004230), Edg4 (NM_004720), Edg7 (Nm_012152) and related genes may play 
an important role in combating human disease. 

15 Genes Involved in Cholesterol Signaling and Biosynthesis 

Studies on model genetic organisms such as Drosophila and C elegans have 
led to the identification of a plethora of genes that are essential for early development. 
Mutational analysis and ectopic expression studies have allowed many of these genes 
to be grouped into discreet signal transduction pathways and have shown that these 

20 elements play critical roles in pattern formation and cell differentiation. Disruption of 
one or more of these genes during early stages of development frequently leads to 
birth defects whereas as alteration of gene function at later stages in life can result in 
tumorigenesis. _ _ ^ 

25 One critical set of interactions known to exist in both invertebrates and 

vertebrates is the Sonic Hedgehog-Patched-Gli pathway. Originally documented as a 
Drosophila segmentation mutant, several labs have recently identified human and 
mouse orthologs of many of the pathways members and have successfully related 
disruptions in these genes to known diseases. Pathway activation is initiated with the 

30 secretion of Sonic hedgehog. There are three closely related members of the Shh 
family (Sonic hedgehog, Desert, and Indian) with Shh being the most widely 
expressed form of the group. The Shh gene product is secreted as a small pro-signal 
molecule. To successfully initiate its developmental role, Shh is first cleaved, 
whereupon the N-terminal truncated fragment is covalently modified with cholesterol. 
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The addition of the sterol moiety promotes the interaction between Shh and its 
cognate membrane bound receptor, Patched (Ptch). There are at least two isoforms of 
the Patched gene, Ptchl and Ptch2. Both isoforms contain a sterol-sensing domain 
(SSD); a roughly 180 amino acid cluster that is found in at least seven different 
classes of molecules including those involved in cholesterol biosynthesis, vesicular 
traffic, signal transduction, cholesterol transport, and sterol homeostasis. In the 
absence of Shh, the Patched protein is a negative regulator of the pathway. In contrast 
binding of Shh-cholesterol to the Patched receptor releases the negative inhibition 
which that molecule enforces on a G-protein coupled receptor known as Smoothened 
Subsequent activation of Smoothened (directly or indirectly) leads to the triggering of 
a trio of transcription factors that belong to the Gli family. All three factors are 
relatively large, contain a characteristic C2-H2 zinc-finger-pentamer, and recognize 
one of two consensus sequences (SEQ. ID NO. 0463 GACCACCCA or SEQ ID NO 
0464 GAACCACCCA). In the absence of Shh, Gli proteins are cleaved by the 
proteosome and the C-terminally truncated fragment translocates to the nucleus and 
acts as a dominant transcription repressor. In the presence of Shh-cholesterol, Gli 
repressor formation is inhibited and full-length Gli functions as a transcriptional 
activator. 

Shh and other members of the Shh-PTCH-GH pathway are expressed in a 
broad range of tissues {e.g. the notochord, the floorplate of the neural tube, the brain 
and the gut) at early stages in development. Not surprisingly, mutations that lead to 
altered protein expression or function have been shown to induce developmental 
abnormalities. Defects in the human Shh gene have been shown to cause 
holoprosencephaly, a midline defect that manifests itself as cleft lip or palate, CNS 
septation, and a wide range of other phenotypes. Interestingly, defects in cholesterol 
biosynthesis generate similar Shh-like disorders {e.g. Smith-Lemli-Opitz syndrome) 
suggesting that cholesterol modification of the Shh gene product is crucial for 
pathway function. Both the Patched and Smoothened genes have also been shown to 
be clinically relevant with Smoothened now being recognized as an oncogene that 
hke PTCH-1 and PTCH-2, is believed to be the causative agent of several forms of 
adult tumors. For these reasons, siRNA directed against Smoothened (SMO 
NM_00563 1), Patched (PTCH, nm_000264), and additional genes that participate in 
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cholesterol signaling, biosynthesis, and degradation, have potentially useful research 
and therapeutic applications. 

Targeted Pathways . 

5 In addition to targeting siRNA against one or more members of a family of 

proteins, siRNA can be directed against members of a pathway. Thus, for instance, 
siRNA can be directed against members of a signal transduction pathway (e.g. the 
insulin pathway, including AKT1-3, CBL, CBLB, EIF4EBP1, FOXOl A, FOX03A, 
FRAP1, GSK3A, GSK3B, IGF1, IGF1R, INPP5JD, INSR, IRS1, MLLT7, PDPK1, 
10 PIK3CA, PIK3CB, PIK3R1, PIK3R2, PPP2R2B, PTEN, RPS6, RPS6KA1, 

RPX6KA3, SGK, TSC1, TSC2, AND XPOl), an apoptotic pathway (CASP3, 6,7,8,9, 
DSH1/2, PI 10, P85, PDK1/2, CATENIN, HSP90, CDC37, P23, BAD, BCLXL, 
BCL2, SMAC, and others), pathways, involved in DNA damage, cell cycle, and other 
physiological (p53,MDM2, CHK1/2, BRCA1/2, ATM, ATR, P15INK4, P27, P21, 
15 SKP2, CDC25C/A, 14-3-3, PLK, RB, CDK4, GLUT4, Inos, Mtor, FKBP, PPAR, 
RXR, ER). Similarly, genes involved in immune system function including TNFR1, 
IL-IR, IRAKI/2, TRAF2, TRAF6, TRADD, FADD, IKKs, IKKy, IKK(5, IKKa, 
IkBa, IkBp, p50, p65, Rac, RhoA, Cdc42, ROCK, Pakl 72/3/4/5/6, cIAP, HDAC1/2, 
CBP, p-TrCP, Rip2/4, and others are also important targets for the siRNAs described 
20 in this document and may be useJul in treating immune system disorders. Genes 

involved in apoptosis, such as Dshl/2,PTEN, P110 (pan), P85, PDK1/2, Aktl, Akt2, 
Akt (pan), p70 S6K , GSK3(3, PP2A (cat), p-catenin, HSP90, Cdc37/p50, P23, Bad, 
BclxL, Bcl2, Smac/Diablo, and Askl are potentially useful in the treafeent of 
diseases that involve defects in programmed cell death (e.g. cancer), while siRNA 
25 agents directed against p53, MDM2, Chkl/2, BRCA1/2, ATM, ATR, plS 1 ^ 4 , P27, 
P21, Skp2, Cdc25C/A, 14-3-3a/s, PLK, Rb, Cdk4, Glut4, iNOS, mTOR, FKBP, 
PPARy, RXRot, ERa and related genes may play a critical role in combating diseases 
associated with disruptions in DNA repair, and cell cycle abnormalities. 

3 o Tables VI -Table X below provide examples of useful pools for inhibiting 

different genes in the human insulin pathway and tyrosine kinase pathways, proteins 
involved in the cell cycle, the production of nuclear receptors, and other genes. These 
particular pools are particularly useful in humans, but would be useful in any species 
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that generates an appropriately homologous mRNA. Further, within each of the listed 
pools any one sequence maybe used independently but preferably at least two of the 
listed sequences, more preferably at least three, and most preferably all of the listed 
sequences for a given gene is present. 
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GSK3A 



GSK3A 



NM 019884 



GSK3A 



NM 019884 



GSK3A 



NM 019884 



NM 019884 



11995473 



11995473 



11995473 



2931 



2931 



2931 



11995473 



2931 



D-003009-05 



D-003009-06 



D-003009-07 



D-003009~08 



GGACAAAGGTGTTCAAATC_ 



GAACCCAGCTGCCTAACAA 
GCGCACAGCTTCTTTGATG 



nHTCTAGCCTGCTGGAGTA 



501 



502 
503 



504 



GSK3B 



GSK3B 



NM 002093 



21361339 



GSK3B 



NM 002093 



21361339 



GSK3B 



NM 002093 



21361339 



GSK3B 



NM 002093 



2932 



2932 



D-003010-05 



nAAGAAAGATGAGGTCTAT 



505 



D-00301 0-06 



2932 



21361339 



2932 



D-00301 0-07 GAAATGAACCCAAACTACA 



D-0030 10-08 



GGACCCAAATGTCAAACTA 



506 



507 



nATGAGGTCTATCTTAATC 



508 



IGF1 



NM 000618 



D-0030 11-05 



nGAAGTACATTTGAAGAAC 



I1GF1 
IIGF1 



NM 000618 



D-00301 1-06 



AGAAGGAAGTACATTTGAA 
HHTCAAGCCTGCCAAGTCA 



509 



510 
511 



NM 000618 



D-00301 1-07 



IGF1 



NM 00061 8 



D-0030 11-08 



(^GTGGATGCTCTTCAGTTC 



512 



1GF1R 



NM 000875 



11068002 



IIGF1R 



NM 000875 



11068002 



IGF1R 



NM 000875 



11068002 



IGF1R 



1NPP5D 
1NPP5D 
INPP5D 



NM 000875 



11068002 



NM 005541 



5031798 



3480 



D-0030 12-05 



3480 



D-0030 12-06 



3480 



D-0030 12-07 



3480 



3635 



D-003012-08 



D-003013-05 



CAACGAAGCTTCTGTGATG 
GGCCAGAAATGGAGAATAA 



513 



514 



fiAAGCACCCTTTAAGAATG 



515 



GCAGACACCTACAACATCA 



516 



GGAATTGCGTTTACACTTA 



517 
518 



NM 005541 5031 798 



3635 



D-0030 13-06 



^GAAACTGATCATTAAGAA 



INPP5D 



NM 005541 



5031798 



INPP5D 



INSR 



INSR 



INSR 



INSR 



INSR 



IRS1 



IRS1 
IRS1 



NM 005541 



5031798 



NM 000208 



NM 000208 



NM 000208 



NM 000208 



NM 005544 



3635 



D-0Q3013-Q7 CGACAGGGATGAAGTACAA 



3635 



n-003013-08 AAACGCAGCTGCCCATCTA 



519 
520 



4557883 



3643 



D-0030 14-05 



4557883 



3643 



D-0030 14-06 



4557883 



3643 



D~003Q14-07 



4557883 



3643 



D-0030 14-08 



5031804 



3667 



D-0030 15-05 



QGAAGACGTTTGAGGATTA 



GAACAAGGCTCCCGAGAGT 
GGAGAGACCTTGGAAATTG_ 



521 



522 



nGACGGAACCCACCTATTT 



AAAGAGGTCTGGCAAGTGA 



523 



524 



525 
526 



NM 005544 



5031804 



3667, 



D-003015-06 



nAACCTGATTGGTATCTAC 



I IRS1 



NM 005544 



5031804 



,IRS1 



NM 005544 



5031804 



3667 
3667 



D-0030 15-07 



nHAOGGCGATCTAGTGCTT 



527 



D-003015-08 



nTOAGTCTGTCGTCCAGTA 



528 



MLLT7 



MLLT7 



IMLLT7 



MLLT7 



MLLT7 



PPPK1 
PDPK1 



NM 005938 



5174578 



4303 



NM 005938 



5174578 



4303 



NM 005938 



5174578 



NM 005938 5174578 



D-00301 6-05 GGACTGGACTTCAACTTTG 
n-nfv*ni fi-nfi I r,C ACGA AGCAGTTCAAAXG 



529 



4303 



4303 



D-00301 6-07 



D-00301 6-08 



530 



GAGAAGCGACTGACACTTG 
GACCAGAGATCGCTAACCA 



531 



532 



NM 002613 



4505694 



5170 



n-003017-05" CAAGAGACCTCGTGGAGAA 



533 
534 



PDPK1 
PDPK1 



NM 002613 4505694 



5170 D-00301 7-06 



nACCAGAGGCCAAGAATTT 



NM 002613 



4505694 



PDPK1 



NM 002613 



4505694 



PIK3CA 



PIK3CA 



NM 006218 



P1K3CA 



NM 006218 



P1K3CA 



NM 006218 



PIK3CA 



NM 006218 



PIK3CB 



PIK3CB 



NM 006219 



5170 D-00301 7-07 



5170 



D-0030 17-08 



5453891 



5290 



5453891 



5290 



5453891 



5290 



5453891 



5290 



5453893 



5291 



GGAAACGAGTATCTTATAT 
QAGAAGCGACATATCATAA 



535 
536 



D-00301 8-05 



D-00301 8-06 



G CT ATC ATCTG A AC AATT A 537 
G G AT AG AG G CC AAAT AAT A I 538 



D-00301 8-07 



D-00301 8-08 



GGACAACTGTTTCATATAG I 53S 
^CCAGTACCTCATGGATTA I 54C 



D-00301 9-05 



CGACAAGACTGCCGAGAGA_ 



541 
54S 



PIK3CB 



PIK3CB 



PIK3CB 



NM 006219 



5453893 



NM 006219 



5453893 



NM 006219 



5453893 



5291 



D-00301 9-06 



5291 



D-00301 9-07 



TCAAGTGTCTCCTAATATG 
nGATTCAGTTGGAGTGATT 



5291 



D-00301 9-08 TTTCAAGTGTCTCCTAATA 



54> 
54< 



PIK3R1 



10S 
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PIK3R1 


NM 181504 


32455251 


5295 


D-003020-05 


GGAAATATGGCTTCTCTGA 


545 


PIK3R1 


NM 181504 


32455251 


5295 


D-003020-06 


GAAAGACGAGAGACCAATA 


546 


PIK3R1 


NM 181504 


32455251 


5295 


D-003020-07 


GTAAAGCATTGTGTCATAA 


547 


PIK3R1 


NM 181504 


32455251 


5295 


D-003020-08 


GGATCAAGTTGTCAAAGAA 


548 


PIK3R2 














PIK3R2 


NM 005027 


4826907 


5296 


D-003021-05 


G G AAAG G CG G G A AC AAT AA 


548 


PIK3R2 


NM 005027 


4826907 


5296 


D-003021-06 


GATGAAG CGTACTG CAATT 


55C 


P1K3R2 


NM 005027 


4826907 


5296 


D-003021-07 


GGACAGCGAATCTCACTAC 


551 


P1K3R2 


NM_005027 


4826907 


5296 


D-003021-08 


GCAAGATCCGAGACCAGTA 


552 


PPP2R2B 














PPP2R2B 


NM_004576 


4758953 


5521 


D-003022-05 


GAATGCAGCTTACTTTCTT 


552 


PPP2R2B 


NM 004576 


4758953 


5521 


D-003022-06 


GACCGAAGCTGACATTATC 


554 


PPP2R2B 


NM 004576 


4758953 


5521 


D-003022-07 


TCGATTACCTGAAGAGTTT 


555 


PPP2R2B 


NM 004576 


4758953 


5521 


D-003022-08 


CCTGAAGAGTTTAGAAATA 


55G 


PTEN 














PTEN 


NM 000314 


4506248 


5728 


D-003023-05 


GTGAAGATCTTGACCAATG 


557 


PTEN 


NM 000314 


4506248 


5728 


D-003023-06 


GATCAGCATACACAAATTA 


55S 


PTEN 


NM 000314 


4506248 


5728 


D-003023-07 


GGCGCTATGTGTATTATTA 


55S 


PTEN 


NM 000314 


4506248 


5728 


D-003023-08 


GT AT AG AG CGTG C AG ATAA 


56C 


RPS6 














RPS6 


NM 001010 


1 71 58043 


6194 


D-003024-05 


GCCAGAAACTCATTGAAGT 


561 


RPS6 


NM 001010 


17158043 


6194 


D-003024-06 


G G ATATTCCTGG ACTG ACT 


565 


RPS6 


NM 001010 


17158043 


6194 


D-003024-07 


CCAAGGAGAACTGGAGAAA 


56c 


RPS6 


NM 001010 


17158043 


6194 


D-003024-08 


GCGTATGGCCACAGAAGTT 


564 


RPS6KA1 














RPS6KA1 


NM 002953 


20149546 


6195 


D-003025-05 


GATGACACCTTCTACTTTG 


56£ 


RPS6KA1 


NM 002953 


20149546 


6195 


D-003Q25-06 


G AG AATG G G CTCCTC ATG A 


566 


RPS6KA1 


NM 002953 


20149546 


6195 


D-003025-07 


CAAGCGGGATCCTTCAGAA 


567 


RPS6KA1 


NM_002953 


20149546 


6195 


D-003025-08 


CCACCGGCCTGATGGAAGA 


568 


RPS6KA3 














RPS6KA3 


NM 004586 


4759049 


6197 


D-003026-05 


GAAGGGAAGTTGTATCTTA 


56£ 


RPS6KA3 


NM 004586 


4759049 


6197 


D-003026-06 


GAAAGTATGTGTATGTAGT 


57C 


RPS6KA3 


NM 004586 


4759049 


6197- - 


D-003026-07 


GGACAGCATCCAAACATTA 


571 


RPS6KA3 


NM 004586 


4759049 


6197 


D-003026-08 


GGAGGTGAATTGCTGGATA 


575 


SGK 














SGK 


NM 005627 


5032090 


6446 


D-003027-01 


TTAATGGTGGAGAGTTGTT 


57c 


SGK 


NM 005627 


5032090 


6446 • 


D-003027-04 


ATTAACTGG G ATG ATCTCA 


574 


SGK 


NM 005627 


25168262 


6446 


D-003027-05 


GAAGAAAGCAATCCTGAAA 


^ 57e 


SGK 


NM 005627 


25168262 


6446 


D-003027-06 


AAACACAGCTGAAATGTAC " 


576 


TSC1 














TSC1 


NM 000368 


24475626 


7248 


D-003028-05 


GAAGATGGCTATTCTGTGT 


577 


TSC1 


NM 000368 


24475626 


7248 


D-003028-06 


TATGAAGGCTCGAGAGTTA 


57£ 


TSC1 


NM 000368 


24475626 


7248 


D-003028-07 


CGACACGGCTGATAACTGA 


57c 


TSC1 


NM 000368 


24475626 


7248 


D-003028-08 


CGGCTGATGTTGTTAAATA 


58C 


TSC2 














TSC2 


NM 000548 


10938006 


7249 


D-003029-05 


GCATTAATCTCTTACCATA 


581 


TSC2 


NM 000548 


10938006 


7249 


D-003029-06 


CCAATGTCCTCTTGTCTTT 


582 


TSC2 


NM 000548 


10938006 


7249 


D-003029-07 


GGAGACACATCACCTACTT 


58? 


TSC2 


NM 000548 


10938006 


7249 


D-003029-08 


TCACCAGGCTCATCAAGAA 


58^ 


XP01 














XP01 


NM 003400 


8051634 


7514 


D-003030-05 


GAAAGTCTCTGTCAAAATA 


58i 


XP01 


NM 003400 


8051634 


7514 


D-003030-06 


GCAATAGGCTCCATTAGTG 


58C 


XPQ1 


NM 003400 


8051634 


7514 


D-003030-07 


GGAACATGATCAACTTATA 


581 


XPQ1 


NM 003400 


8051634 


7514 


D-003030-08 


GGATACAGATTCCATAAAT 


586 
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Gene 
Name 


Acc# 


Gl 


LL 


Duplex # 


Sequence 


SE 
ID 


ABL1 














ABL1 


NM 007313 


6382057 


25 


D-0031 00-05 


GGAAATCAGTGACATAGTG 


5 


ABL1 


NM 007313 


6382057 


25 


D-0031 00-06 


GGTCCACACTGCAATGTTT 


5 


ABL1 


NM 007313 


6382057 


25 


D-0031 00-07 


GAAGGAAATCAGTGACATA 


5 


ABL1 


NM 007313 


6382057 


25 


D-0031 00-08 


TCACTGAGTTCATGACCTA 


5 


ABL2 














ABL2 


NM_007314 


6382061 


27 


D-0031 01 -05 


GAAATGGAGCGAACAGATA 


5 


ABL2 


NM 007314 


6382061 


27 


D-0031 01 -06 


GAGCCAAATTTCCTATTAA 


5 


ABL2 


NM 007314 


6382061 


27 


D-0031 01 -07 


GTAATAAG CCTACAGTCTA 


5 


ABL2 


NM 007314 


6382061 


27 


D-0031 01 -08 


G G AGTG AAGTTCG CTCTAA 


5 


ACK1 














ACK1 


NM_005781 


8922074 


10188 


D-0031 02-05 


AAACGCAAGTCGTGGATGA 


5 


ACK1 


NM 005781 


8922074 


10188 


D-0031 02-06 


GCAAGTCGTGGATGAGTAA 


5 


ACK1 


NM 005781 


8922074 


10188 


D-0031 02-07 


GAGCACTACCTCAGAATGA 


5 


ACK1 


NM 005781 


8922074 


10188 


D-0031 02-08 


TCAGCAGCACCCACTATTA 


6 


ALK 












6 


ALK 


NM__004304 


29029631 


238 


D-0031 03-05 


G ACAAG ATCCTG CAGAATA 


ALK 


NM.004304 


29029631 


238 


D-0031 03-06 


GGAAGAGTCTGGCAGTTGA 


6 


ALK 


NM 004304 


29029631 


238 


D-0031 03-07 


G C ACGTG G CTCG G G ACATT 


6 


ALK 


NM 004304 


29029631 


238 


D-0031 03-08 


GAACTGCAGTGAAGGAACA 


6 


AXL 














AXL 


NM 021913 


21536465 


558 


D-0031 04-05 


GGTCAGAGCTGGAGGATTT 


6 


AXL 


NM 021913 


21536465 


558 


D-0031 04-06 


GAAAGAAGGAGACCCGTTA 


6 


AXL 


NM 021913 


21536465 


558 


D-0031 04-07 


CCAAGAAGATCTACAATGG 


6 


AXL 


NM 021913 


21536465 


558 


D-0031 04-08 


G G AACTG C ATG CTG AATG A 


6 


BLK 














BLK 


NM 001715 


4502412 


640 


D-0031 05-05 


GAG G ATG CCTG CTG G ATTT 


6 


BLK 


NM 001715 


4502412 


640 


D-0031 05-06 


ACATGAAGGTGGCCATTAA 


6 


BLK 


NM 001715 


4502412 


640 


D-0031 05-07 


GGTCAGCGCCCAAGACAAG 


6 


BLK 


NM 001715 


4502412 


640 


D-0031 05-08 


GAAACTCGGGTCTGGACAA 


6 


BMX 














BMX 


NM 001721 


21359831 


660 


D-0031 06-05 


AAACAAACCTTTCCTACTA 


6 


-BMX 


NM 001721 


21359831 


660 


D-0031 06-06 


G AAG GAG C ATTT ATGGTTA : 


_6 


BMX 


NM 001721 


21359831 


660 


D-0031 06-07 


GAGAAGAGATTACCTTGTT 


6 


BMX 


NM_001721 


21359831 


660 


D-0031 06-08 


GTAAGGCTGTGAATGATAA 


6 


BTK 














BTK 


NM_000061 


4557376 


695 


D-0031 07-05 


G AAC AG G A ATG G AAG CTTA 


6 


BTK 


NM 000061 


4557376 


695 


D-0031 07-06 


GCTATGGGCTGCCAAATTT 


6 


BTK 


NM 000061 


4557376 


695 


D-0031 07-07 


GAAAGCAACTTACCATGGT 


6 


BTK 


NM_000061 


4557376 


695 


D-0031 07-08 


GGTAAACGATCAAGGAGTT 


6 


C20orf64 














C20orf64 


NM_033550 


19923655 


11285 


D-0031 08-05 


CAACTTAGCCAAGACAATT 


6 


C20orf64 


NM„033550 


19923655 


11285 


D-0031 08-06 


GAAATTGAAGGCTCAGTGA 


6 


C20orf64 


NM„033550 


19923655 


11285 


D-0031 08-07 


TGGAACAGCTGAACATTGT 


6 


C20orf64 


NM_ 033550 


19923655 


11285 


D-0031 08-08 


G CTTCC AACTG CTTAT ATA 


6 


CSF1R 














CSF1R 


NM 005211 


27262658 


1436 


D-0031 09-05 


GGAGAGCTCTGACGTTTGA 


6 


CSF1R 


NM 005211 


27262658 


1436 


D-0031 09-06 


CAACAACGCTACCTTCCAA 


6 


CSF1R 


NM 005211 


27262658 


1436 


D-0031 09-07 


CCACGCAGCTGCCTTACAA 


6 


CSF1R 


NM 005211 


27262658 


1436 


D-0031 09-08 


GGAACAACC I GCAG I I I GG 


6 


CSK 
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CSK 


NM 004383 


4758077 


1445 


D-0031 10-05 


CAGAATGTATTGCCAAGTA 


6 


CSK 


NM 004383 


4758077 


1445 


D-0031 10-06 


GAACAAAGTCGCCGTCAAG 


6 


CSK 


NM_004383 


4758077 


1445 


D-0031 10-07 


GCGAGTGCCTTATCCAAGA 


6 


CSK 


NM 004383 


4758077 


1445 


D-0031 10-08 


GGAGAAGGGCTACAAGATG 


6 


DDR1 














DDR1 


NM 013994 


7669484 


780 


D-0031 11 -05 


GGAGATGGAGTTTGAGTTT 


6 


DDR1 


NM 013994 


7669484 


780 


D-0031 11 -06 


CAGAGGCCCTGTCATCTTT 


6 


DDR1 


NM 013994 


7669484 


780 


D-0031 11 -07 


GCTG GTAG CTGTCAAG ATC 


6 


DDR1 


NM 013994 


7669484 


780 


D-0031 11 -08 


TGAAAGAGGTGAAGATCAT 


6 


DDR2 














UUtv 


My nno-i oo 
NM UUbloZ 


545381 3 


4921 


D-0031 12-05 


GGTAAGAACTACACAATCA 


6 


DDR2 


NM 006182 


5453813 


4921 


D-0031 12-06 


G AACG AG AGTG CCACCAAT 


6 


DDR2 


NM 0081 £9 
INIVI \j\J\j\kj4L 


C/ICOQ-I O 

O^fOOO 1 o 




U-UUO I \ /.-K) f 


AOAOGAATCTGAAGTTTAT 


6 


DDR2 


NM 006182 


5453813 


4921 


D-0031 12-08 


CAACAAG AATG CCAG G AAT 


6 


DKFZp761 
P1010 














DKFZp761 
P1010 


NM_0 18423 


8922178 


55359 


D-0031 1 3-0*1 

1—/ \J\J \J 1 1 \J \J\J 




0 


DKFZp761 
P1010 


NM_0 18423 


8922178 


55359 


D-00^1 1^-0fi 


C% ATT A P PTf^ PTT A TCX A 


c 
D 


DKF7n761 
P1010 


MM MRAO^K 


QQOOH TO 


CCQCQ 

ooooy 


U-\j\Jo \ l o-U/ 


OGCAGTAGCTGCACACATA 


6 


DI\rZp/o1 
V 1 U l U 


NM_0 18423 


8922178 


55359 


D-0031 13-08 


GGTGGTACCTGAACTGTAT 


6 


EGFR 














EGFR 


NM 005228 


4885198 


1956 


D-0031 14-05 


G AAG G A AACTG AATTC AAA 


6 


EGFR 


NM 005228 


4885198 


1956 


D-0031 14-06 


GGAAATATGTACTACGAAA 


6 


EGFR 


NM 005228 


4885198 


1956 


D-0031 14-07 


CCACAAAGCAGTGAATTTA 


6 


EGFR 


NM 005228 


4885198 


1956 


D-0031 14-08 


GTAACAAGCTCACGCAGTT 


6 


EPHA1 














EPHA1 


NM 005232 


4885208 


2041 


D-0031 15-05 


GACCAGAGCTTCACCATTC 


6 


EPHA1 


NM 005232 


4885208 


2041 


D-003115-06 


GCAAGACTGTGGCCATTAA 


6 


EPHA1 


NM 005232 


4885208 


2041 


D-003T15-07 


GGGCGAACCTGACCTATGA 


6 


EPHA1 


NMJD05232 


4885208 


2041 


D-0031 15-08 


GATTGTAGCCGTCATCTTT 


6 


EPHA2 














EPHA2 


NM 004431 


4758277 


1969 


D-003116-05 


GGAGGGATCTGGCAACTTG 


6 


EPHA2 


NM 004431 


4758277 


1969 


D-0031 16-06 


GCAGCAAGGTGCACGAATT 


6 


EPHA2 


NM 004431 


4758277 


1969 


D-0031 16-07 


GGAGAAGGATGGCGAGTTC' 




EPHA2 


NM 004431 


4758277 


1969 


D-0031 16-08 


GAAGTTCACTACCGAGATC 


6 


EPHA3 














EPHA3 


NM 005233 


21361240 


2042 


D-0031 17-05 


G ATCG G ACCTCCAG AAATA 


6 


EPHA3 


NM 005233 


21361240 


2042 


D-0031 17-06 


GAACTCAGCTCAGAAGATT 


6 


EPHA3 


NM 005233 


21361240 


2042 


D-0031 17-07 


GCAAGAGGCACAAATGTTA 


6 


EPHA3 


NM 005233 


21361240 


2042 


D-0031 17-08 


GAGCATCAGTTTACAAAGA 


6 


EPHA4 














EPHA4 


NM 004438 


4758279 


I 2043 


D-0031 18-05 


GGTCTGGGATGAAGTATTT 


6 


EPHA4 


NM 004438 


4758279 


2043 


D-0031 18-06 


GAATGAAGTTACCTTATTG 


6 


EPHA4 


NM 004438 


4758279 


2043 


D-0031 18-07 


GAACTTGGGTGGATAGCAA 


6 


EPHA4 


NM 004438 


4758279 


2043 


D-0031 18-08 


GAGATTAAATTCACCTTGA 


6 


EPHA7 














FPUA7 

LrnM / 


MM C\(\AAAC\ 




2045 


D-0031 19-05 


GAAAAGAGATGTTGCAGTA 


6 


EPHA7 


NM 004440 


4758281 


2045 


D-0031 19-06 


CTAGATG CCTCCTGTATTA 


6 


EPHA7 


NM 004440 


4758281 


2045 


D-0031 19-07 


AGAAGAAGG 1 1 A 1 CG 1 1 1 A 


6 


EPHA7 


NM 004440 


4758281 


2045 


D-0031 19-08 


TAGCAAAGCTGACCAAGAA 


6 


EPHA8 














EPHA8 


NM 020526 


18201903 


2046 


D-0031 20-05 


GAAGATGCACTATCAGAAT 


6 
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EPHA8 


NM 020526 


18201903 


2046 


D-0031 20-06 


GAGAAGATGCACTATCAGA 




EPHA8 


NM 020526 


18201903 


2046 


D-0031 20-07 


AACCTGATCTCCAGTGTGA 




EPHA8 


NM 020526 


18201903 


2046 


D-0031 20-08 


TCTC AG ACCTG G G CT ATGT 




EPHB1 














EPHB1 


NM 004441 


21396502 


2047 


D-0031 21 -05 


G CG AT AAGCTCCAG CATT A 




EPHB1 


NM 004441 


21396502 


2047 


D-0031 21 -06 


G A A AC G G G CTT AT AG C AAA 




EPHB1 


NM 004441 


21396502 


2047 


D-0031 21 -07 


G G ATG AAG ATCTAC ATTG A 




EPHB1 


NM 004441 


21 396502 


2047 


D-0031 21 -08 


G CACGTCTCTGTC AAC ATC 




EPHB2 














EPHB2 


NM 017449 


17975764 


2048 


D-0031 22-05 


ACT ATG AG CTG C AGT ACT A 




EPHB2 


NM 017449 


17975764 


2048 


D-0031 22-06 


GT AC AACG CC AC AG C CAT A 




EPHB2 


NM 017449 


1 7975764 


2048 


D-0031 22-07 


G G AAAG C A ATG ACTGTTCT 




EPHB2 


NM_017449 


17975764 


2048 


D-0031 22-08 


CGGACAAGCTGCAACACTA 




EPHB3 














EPHB3 


NM 004443 


17975767 


2049 


D-0031 23-05 


GGTGTGATCTCCAATGTGA 




EPHB3 


NM 004443 


17975767 


2049 


D-0031 23-06 


GGGATGACCTCCTGTACAA 




EPHB3 


NM 004443 ] 


1 7975767 


2049 


D-0031 23-07 


CAGAAGACCTGCTCCGTAT 




EPHB3 


NM_004443 


1 7975767 


2049 


D-0031 23-08 


GAGATGAAGTACTTTGAGA 




EPHB4 














EPHB4 


NM 004444 


17975769 


2050 


D-0031 24-05 


GGACAAACACGGACAGTAT 




EPHB4 


NM 004444 


17975769 


2050 


D-0031 24-06 


GTACTAAGGTCTACATCGA 




EPHB4 


NM 004444 


17975769 


2050 


D-0031 24-07 


GGAGAGAAGCAGAATATTC 




EPHB4 


NM 004444 


17975769 


2050 


D-0031 24-08 


GCCAATAGCCACTCTAACA 




EPHB6 














EPHB6 


NM_004445 


4758291 


2051 


D-0031 25-05 


GGAAGTCGATCCTGCTTAT 




EPHB6 


NM 004445 


4758291 


2051 


D-0031 25-06 


GGACCAAGGTGGACACAAT 




EPHB6 


NM 004445 


4758291 


2051 


D-0031 25-07 


TGTGGGAAGTGATGAGTTA 




EPHB6 


NM 004445 


4758291 


2051 


D-0031 25-08 


CGGGAGACCTTCACCCTTT 




ERBB2 














ERBB2 


NM 004448 


4758297 


2064 


D-0031 26-05 


GGACGAATTCTGCACAATG 




ERBB2 


NM 004448 


4758297 


2064 


D-0031 26-06 


GACGAATTCTGCACAATGG 




ERBB2 


NM 004448 


4758297 


2064 


D-0031 26-07 


CTACAACACAGACACGTTT 




ERBB2 


NM 004448 


4758297 


2064 


D-0031 26^08^ 


AGACGAAGCATACGTGATG 




ERBB3 














ERBB3 


NM 001982 


4503596 


2065 


D-0031 27-05 


AAGAGGATGTCAACGGTTA 




ERBB3 


NM 001982 


4503596 


2065 


D-0031 27-06 


GAAGACTGCCAGACATTGA 




ERBB3 


NM 001982 


4503596 


2065 


D-0031 27-07 


GACAAACACTGGTGCTGAT 




ERBB3_ 


NM 001982 


4503596 


2065 


D-0031 27-08 


GCAGTGGATTCGAGAAGTG 




ERBB4 














ERBB4 


NM 005235 


4885214 


2066 


D-0031 28-05 


GAGGAAAGATGCCAATTAA 




ERBB4 


NM 005235 


4885214 


2066 


D-0031 28-06 


GCAGGAAACATCTATATTA 




ERBB4 


NM 005235 


4885214 


2066 


D-0031 28-07 


G ATCACAACTG CTGCTTAA 




ERBB4 


NM 005235 


4885214 


2066 


D-0031 28-08 


CCTCAAAGATACCTAGTTA 




FER 














FER 


NM 005246 


4885230 


2241 


D-0031 29-05 


GGAGTGACCTGAAGAATTC 




FER 


NM 005246 


4885230 


2241 


D-0031 29-06 


TAAAGCAGATTCCCATTAA 




FER 


NM 005246 


4885230 


2241 


D-0031 29-07 


GGAAAGTACTGTCCAAATG 




FER 


NM 005246 


4885230 


2241 


D-0031 29-08 


GAACAACGGCTGCTAAAGA 




FES 














FES 


NM 002005 


13376997 


2242 


D-0031 30-05 


CGAGGATCCTGAAGCAGTA . 




FES 


Kill /-\ *-\ /*-\ /-\ /-\ *~ 

NM 002005 


1 3376997 


2242 


D-0031 30-06 


AGGAATACCTGGAGATTAG 




FES 


NM 002005 


13376997 


2242 


D-0031 30-07 


C AAC AG GAG CTCCG G AATG 




FES 


NM 002005 


13376997 


2242 


D-0031 30-08 


GGTGTTGGGTGAGCAGATT 




FGFR1 














FGFR1 


NM 000604 


13186232 


2260 


D-0031 31 -05 


TAAGAAATGTCTCCTTTGA 




FGFR1 


NM 000604 


13186232 


2260 


D-0031 31 -06 


GAAGACTGCTGGAGTTAAT 
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FGFR1 


NM 000604 


13186232 


2260 


D-003131-07 


GATGGTCCCTTGTATGTCA 


7 


FGFR1 


NM 000604 


13186232 


2260 


D-003131-08 


CTTAAGAAATGTCTCCTTT 


7 


FGFR2 














FGFR2 


NM 000141 


13186239 


2263 


D-0031 32-05 


CCAAATCTCTCAACCAGAA 


7 


FGFR2 


NM_000141 


13186239 


2263 


D-0031 32-06 


GAACAGTATTCACCTAGTT 


7 


FGFR2 


NM 000141 


13186239 


2263 


D-0031 32-07 


GGCCAACACTGTCAAG I I I 


7 


FGFR2 


NM 000141 


13186239 


2263 


D-0031 32-08 


GTGAAGATGTTGAAAGATG 


7 


FGFR3 














FGFR3 


NM 000142 


13112046 


2261 


D-0031 33-05 


TGTCGGACCTGGTGTCTGA 


7 


FGFR3 


NM 000142 


13112046 


2261 


D-0031 33-06 


GCATCAAGCTGCGGCATCA 


7 


FGFR3 


NM 000142 


13112046 


2261 


D-0031 33-07 


GG ACG G CACACCCTACGTT 


7 


FGFR3 


NM 000142 


13112046 


2261 


D-0031 33-08 


TGCACAACCTCGACTACTA 


7 


FGFR4 














FGFR4 


NM 002011 


13112051 


2264 


D-0031 34-05 


GCACTGGAGTCTCGTGATG 


7 


FGFR4 


NM 002011 


13112051 


2264 


D-0031 34-06 


CATAGGGACCTCTCGAATA 


7 


FGFR4 


NM 002011 


13112051 


2264 


D-0031 34-07 


ATACGGACATCATCCTGTA 


7 


FGFR4 


NM 002011 


13112051 


2264 


D-0031 34-08 


ATAGGGACCTCTCGAATAG 


7 


FGR 














FGR 


NM 005248 


4885234 


2268 


D-0031 35-05 


GCGATCATGTGAAGCATTA 


7 


FGR 


NM_005248 


4885234 


2268 


D-0031 35-06 


TCACTG AG CTCATCACC AA 




FGR 


NM 005248 


4885234 


2268 


D-0031 35-07 


G AAG AGTG GT ACTTTG G AA 




FGR 


NM 005248 


4885234 


2268 


D-0031 35-08 


CCCAGAAGCTGCCCTCTTT 




FLT1 














FLT1 


NM_002019 


4503748 


2321 


D-0031 36-05 


GAG C A AACGTG ACTT ATTT 




FLT1 


NM_002019 


4503748 


2321 


D-0031 36-06 


CCAAATGGGTTTCATGTTA 




FLT1 


NM 002019 


4503748 


2321 


D-0031 36-07 


C AAC A AG G ATG C AG C ACT A 




FLT1 


NM_002019 


4503748 


2321 


D-0031 36-08 


GGACGTAACTGAAGAGGAT 




FLT3 














FLT3 


NM 004119 


4758395 


2322 


D-0031 37-05 


GAAGGCATCTACACCATTA 




FLT3 


NM 004119 


4758395 


2322 


D-0031 37-06 


GAAGGAGTCTGGAATAGAA 




FLT3 


NM 004119 


4758395 


2322 


D-0031 37-07 


GAATTTAAGTCGTGTGTTC 




FLT3 


NM 004119 


4758395 


2322 


D-0031 37-08 


GGAATTCATTTCACTCTGA 




FLT4 














FLT4 


NM 002020 


4503752 


2324 


D-0031 38-05 


GCAAGAACGTGCATCTGTT 




FLT4 


NM 002020 


4503752 


2324 


D-0031 38-06 


GCGAATACCTGTCCTACGA 




FLT4 


NM 002020 


4503752 


2324 


D-0031 38-07 


GAAGACATTTGAGGAATTC 




FLT4 


NM 002020 


4503752 


2324 


D-0031 38-08 


GAGCAGCCATTCATCAACA 




FRK_ 








- 






FRK 


NM 002031 


4503786 


2444 


D-0031 39-05 


GAAACAGACTCTTCATATT 




FRK 


NM 002031 


4503786 


2444 


D-0031 39-06 


GAACAATACCACTCCAGTA 




FRK 


NM 002031 


4503786 


2444 


D-0031 39-07 


C AAG ACCG GTTCCTTTCT A 




FRK 


NM 002031 


4503786 


2444 


D-0031 39-08 


GCAAGAATATCTCCAAAAT 




FYN 














FYN 


NM 002037 


23510344 


2534 


D-0031 40-05 


GGAATGGACTCATATGCAA 




FYN 


NM 002037 


23510344 


2534 


D-0031 40-06 


GCAGAAGAGTGGTACTTTG 




FYN 


NM 002037 


23510344 


2534 


D-0031 40-07 


CAAAGGAAGTTTACTGGAT 




FYN 


NM 002037 


23510344 


2534 


D-0031 40-08 


G AAG AGTG GT ACTTTG G AA 




HCK 














HCK 


NM 002110 


4504356 


3055 


D-0031 41 -05 


GAGATACCGTGAAACATTA 




HCK 


NM 002110 


4504356 


3055 


D-0031 41 -06 


GCAGGGAGATACCGTGAAA 




HCK 


NM 002110 


4504356 


3055 


D-0031 41 -07 


CATCGTGGTTGCCCTGTAT 




HCK 


NM 002110 


4504356 


3055 


D-0031 41 -08 


TGTGTAAGATTGCTGACTT 




ITK 














ITK 


NM 005546 


21614549 


3702 


D-0031 44-05 


CAAATAATCTGGAAACCTA 




ITK 


NM 005546 


21614549 


3702 


D-0031 44-06 


GAAGAAACGAGGAATAATA 




ITK 


NM 005546 


21614549 


3702 


D-0031 44-07 


GAAACTCTCTCATCCCAAA 





WO 2004/045543 



PCT/US2003/036787 



111 



ITK 


NM 005546 


21614549 


3702 


D-003 144-08 


GGAATGGGCATGAAGGATA 


7 


JAK1 














JAK1 


NM 002227 


4504802 


3716 


D-0031 45-05 ! 


CCACATAGCTGATCTGAAA 


7 


JAK1 


NM 002227 


4504802 


3716 


D-003 145-06 


TGAAATCACTCACATTGTA 


7 


JAK1 


NM 002227 


4504802 


3716 


D-0031 45-07 | 


TAAGGAACCTCTATCATGA 


7 


JAK1 


NM 002227 


4504802 


3716 


D-0031 45-08 


G C AG GT G G CT GTT AAATCT 


7 


JAK2 














JAK2 


NM 004972 


1 3325062 


3717 


D-0031 46-05 


GCAAATAGATCCAGTTCTT 


7 


JAK2 


NM 004972 


1 3325062 


3717 


D-0031 46-06 


GAGCAAAGATCCAAGACTA 


7 


JAK2 


NM 004972 


13325062 


3717 


D-0031 46-07 


GCCAGAAACTTGAAACTTA 


7 


JAK2 


NM 004972 


1 3325062 


3717 


D-0031 46-08 


GTACAGATTTCGCAGATTT 


7 


JAK3 














JAK3 


NM 000215 


4557680 


3718 


D-0031 47-05 


GCGCCTATCTTTCTCCTTT 


7 


JAK3 


NM 000215 


4557680 


3718 


D-0031 47-06 


CCAGAAATCGTAGACATTA 


7 


JAK3 


NM 000215 


4557680 


3718 


D-0031 47-07 


CCTCATCTCTTCAGACTAT 


7 


JAK3 


■lift M jf-v A *^ *-y Jk |— 

NM 000215 


4557680 


3718 


D-0031 47-08 


TGTACGAGCTCTTCACCTA 


7 


KDR 














KDR 


NM 002253 


11321596 


3791 


D-0031 48-05 


GGAAATCTCTTGCAAGCTA 


7 


KDR 


NM 002253 


11321596 


3791 


D-0031 48-06 


G ATTACAG A I C I CCA I I I A 


7 


KDR 


NM 002253 


11321596 


3791 


D-0031 48-07 


G CAG AC AG ATCTACGTTTG 


7 


KDR 


NM 002253 


11321596 


3791 


D-0031 48-08 


GCGATGGCCTCTTCTGTAA 


7 


KIAA1 079 














KIAA1 079 


NM 014916 


7662475 


22853 


D-0031 49-05 


GAAATTCTCTCAACTGATG 


7 


KIAA1 079 


NM 014916 


7662475 


22853 


D-0031 49-06 


GCAG AG GTCTTCACACTTT 


7 


KIAA1 079 


NM 014916 


7662475 


22853 


D-0031 49-07 


TAAATGATCTTCAGACAGA 


7 


1/1 A A A /-v — # ^\ 

K1AA1 079 


NM 014916 


7662475 


22853 


D-0031 49-08 


GAG CAG CCCTACTCTG AT A 


7 


KIT 














KIT 


NM 000222 


4557694 


3815 


D-0031 50-05 


AAAC AC G G CTT AAG C AATT 


7 


KIT 


NM 000222 


4557694 


3815 


D-0031 50-06 


GAACAGAACCTTCACTGAT 


7 


KIT 


NM 000222 


4557694 


3815 


D-0031 50-07 


GGGAAGCCCTCATGTCTGA 


7 


KIT 


NM 000222 


4557694 


3815 


D-0031 50-08 


GCAATTCCATTTATGTGTT 


7 


LCK 














LCK 


NM 005356 


20428651 


3932 


D-0031 51 -05 


GAACTGCCATTATCCCATA 


7 


LCK 


NM 005356 


20428651 


3932 


D-0031 51 -06 


GAGAGGTGGTGAAACATTA 


7 


LCK 


NM 005356 


20428651 


3932 


D-0031 51 -07 


G GG CC AAGTTTCCC ATTAA 


7 


LCK 


NM 005356 


20428651 


3932 


D-0031 51 -08 


GCACGCTGCTCATCCGAAA 


7 


LTK 














LTK 


NM 002344 


4505044 


4058 


D-0031 52-05 - 


TGAATTCACTCCTG CCAAT 


7 


LTK 


NM ~ 002344 


4505044 


4058 


D-0031 52-06 


GTGGCAACCTCAACACTGA 


7 


LTK 


NM 002344 


4505044 


4058 


D-0031 52-07 


GGAGCTAGCTGTGGATAAC 


7 


LTK 


NM 002344 


4505044 


4058 


D-0031 52-08 


GCAAGTTTCGCCATCAGAA 


7 


LYN 














LYN 


NM 002350 


4505054 


4067 


D-0031 53-05 


GCAGATGGCTTGTGCAGAA 


7 


LYN 


NM 002350 


4505054 


4067 


D-0031 53-06 


GGAGAAGGCTTGTATTAGT 


7 


1 \/K 1 

LYN 


NM 002350 


4505054 


4067 


D-0031 53-07 


GATGAGCTCTATGACATTA 


7 


1 \/K 1 

LYN 


kill /"\ /~\ f\ r— 

NM 002350 


4505054 


4067 


D-0031 53-08 


GGTGCTAAGTTCCCTATTA 


7 


MATK 














MATK 


NM 002378 


21450841 


4145 


D-0031 54-05 


TGAAGAATATCAAGTGTGA 


7 


MATK 


NM 002378 


21450841 


4145 


D-0031 54-06 


CCGCTCAGCTCCTGCAGTT 


7 


MATK 


NM 002378 


21450841 


4145 


D-0031 54-07 


TACTGAACCTGCAGCATTT 


7 


IVIrA 1 l\ 


mm nno^TR 


OA A KC\Q.A A 


A A A R 

41 40 


U-UUol o4-Uo 


rGGGAGGTCTTCTCATATG 


8 


MERTK 














MERTK 


NM 006343 


5453737 


10461 


D-0031 55-05 


GAACTTACCTTACATAGCT 


8 


MERTK 


NM 006343 


5453737 


10461 


D-0031 55-06 


GGACCTGCATACTTACTTA 


8 


MERTK 


NM 006343 


5453737 


10461 


D-0031 55-07 


TGACAGGAATCTTCTAATT 


8 


MERTK 


NM 006343 


5453737 


10461 


D-0031 55-08 


GGTAATGGCTCAGTCATGA 


8 



WO 2004/045543 



PCT/US2003/036787 



112 



MET 














MET 


k IR A A A Art A l"~ 

NM 000245 


4557746 


4233 


D-0031 56-05 


GAAAGAACCTCTCAACATT 


8 


MET 


NM 000245 


4557746 


4233 


D-0031 56-06 


GGACAAGGCTGACCATATG 


8 


MET 


NM 000245 


4557746 


4233 


D-0031 56-07 . 


CCAATGACCTGCTGAAATT 


8 


k «i i 

MET 


NM 000245 


4557746 


4233 


D-0031 56-08 


GAG C AT AC ATTAAACC AA A 


8 


MST1 R 














MST1R 


NM 002447 


4505264 


4486 


D-0031 57-05 


GGATGGAGCTGCTGGCTTT 


8 


MST1R 


NM 002447 


4505264 


4486 


D-0031 57-06 


CTGCAGACCTATAGATTTA 


8 


MST1R 


NM 002447 


4505264 


4486 


D-0031 57-07 


GCACCTGTCTCACTCTTGA 


8 


MST1 R 


NM 002447 


4505264 


4486 


D-0031 57-08 


GAAAGAGTCCATCCAGCTA 


8 


MUSK 














MUSK 


NM 005592 


5031926 


4593 


D-0031 58-05 


GAAGAAGCCTCGGCAGATA 


8 


MUSK 


NM 005592 


5031926 


4593 


D-0031 58-06 


GTAATAATCTCCATCATGT 


8 


MUSK 


NM 005592 


5031926 


4593 


D-0031 58-07 


GGAATGAACTGAAAGTAGT 


8 


MUSK 


NM_005592 


5031926 


4593 


D-0031 58-08 


GAGATTTCCTGGACTAGAA 


8 


NTRK1 














NTRK1 


NM 002529 


458571 1 


4914 


D-0031 59-05 


GGACAACCCTTTCGAGTTC 


8 


NTRK1 


NM 002529 


458571 1 


4914 


D-0031 59-06 


CCAGTGACCTCAACAGGAA 


8 


NTRK1 


NM_002529 


458571 1 


4914 


D-0031 59-07 


CCACAATACTTCAGTGATG 


8 


NTRK1 


NM 002529 


458571 1 


4914 


D-0031 59-08 


GAAGAGTGGTCTCCGTTTC 


8 


NTRK2 














NTRK2 


NM 006180 


21361305 


4915 


D-0031 60-05 


GAACAGAAGTAATGAAATC 


8 


NTRK2 


NM 006180 


21361305 


4915 


D-0031 60-06 


GTAATGCTGTTTCTGCTTA 


8 


NTRK2 


NM 006180 


21361305 


4915 


D-0031 60-07 


GCAAGACACTCCAAGTTTG 


8 


NTRK2 


NM 006180 


21361305 


4915 


D-0031 60-08 


GAAAGTCTATCACATTATC 


8 


NTRK3 














NTRK3 


NM 002530 


4505474 


4916 


D-0031 61 -05 


GAGCGAATCTGCTAGTGAA 


8 


NTRK3 


NM_002530 


4505474 


4916 


D-0031 61 -06 


GAAGTTCACTACAGAGAGT 


8 


NTRK3 


NM 002530 


4505474 


4916 


D-0031 61 -07 


GGTCGACGGTCCAAATTTG 


8 


NTRK3 


NM 002530 


4505474 


4916 


D-0031 61 -08 


GAATATCACTTCCATACAC 


8 


PDGFRA 














PDGFRA 


NM 006206 


15451787 


5156 


D-0031 62-05 


G AAACTTCCTG G ACTATTT 


8 


PDGFRA 


NM 006206 


15451787 


5156 


D-0031 62-06 


G AG ATHTG GTCAACTATTT 


8 


PDGFRA 


NM 006206 


15451787 


5156 


D-0031 62-07 


GCACGCCGCTTCCTGATAT 


8 


PDGFRA 


NM 006206 


15451787 


5156 


D-0031 62-08 


CATC AG AG CTG G ATCTAG A 


8 


PDGFRB 














r~\ r™^ 

PDGFRB 


NM 002609 


15451788 


5159 


D-0031 63-05 


GAAAGGAGACGTCAAATAT 


8 


PDGFRB 


NM ,0.02609 


15451788 


5159 


D-0031 63-06 


G G AATG AG GTGGTCAACTT 


8 


PDGFRB 


NM "002609 


15451788 


5159 


D-0031 63-07 


CAACGAGTCTCCAGTGCTA 


8 


PDGFRB 


NM 002609 


15451788 


5159 


D-0031 63-08 


GAGAGGACCTGCCGAGCAA 


8 


PTK2 














PTK2 


NM 005607 


27886592 


5747 


D-0031 64-05 


GAAGTTGGGTTGTCTAGAA 


8 


PTK2 


NM 005607 


27886592 


5747 


D-0031 64-06 


GAAGAACAATGATGTAATC 


8 


PTK2 


NM 005607 


27886592 


5747 


D-0031 64-07 


GGAAATTGCTTTGAAGTTG 


8 


PTK2 


NM 005607 


27886592 


5747 


D-0031 64-08 


G GTTC AAG CTG G ATT ATTT 


8 


PTK2B 














PTK2B 


NM 004103 


27886583 


2185 


D-0031 65-05 


GAACATGGCTGACCTCATA 


8 


PTK2B 


NM 004103 


27886583 


2185 


D-0031 65-06 


GGACCACGCTGCTCTATTT 


8 


PTK2B 


NM 004103 


27886583 


2185 


D-0031 65-07 


G G ACG AG G ACT ATT AC AAA 


8 


PTK2B 


NM 004103 


27886583 


2185 


D-0031 65-08 


TGGCAGAGCTCATCAACAA 


8 


i I r\o 














PTK6 


NMJ)05975 


27886594 


5753 


D-0031 66-05 


GAGAAAGTCCTGCCCGTTT 


8 


PTK6 


NM_005975 


27886594 


5753 


D-0031 66-06 


TGAAGAAGCTGCGGCACAA 


8 


PTK6 


NM 005975 


27886594 


5753 


D-0031 66-07 


CCGCGACTCTGATGAGAAA 


8 


PTK6 


NM_005975 


27886594 


5753 


D-0031 66-08 


TGCCCGAGCTTGTGAACTA 


8 


PTK7 















WO 2004/045543 



PCT/US2003/036787 



113 



PTK7 


NM 002821 


27886610 


5754 


D-0031 67-05 


GAGAGAAGCCCACTATTAA 


8 


PTK7 


NM 002821 


27886610 


5754 


D-0031 67-06 


CGAGAGAAGCCCACTATTA 


8 


PTK7 


NM 002821 


27886610 


5754 


D-0031 67-07 


GGAGGGAGTTGGAGATGTT 


8 


PTK7 


NM 002821 


27886610 


5754 


D-0031 67-08 


GAAGACATGCCGCTATTTG 


8 


PTK9 














PTK9 


NM 002822 


4506274 


5756 


D-0031 68-05 


GAAGAACTACGACAGATTA 


8 


PTK9 


NM 002822 


4506274 


5756 


D-0031 68-09 


GAAG G AG ACTATTTAG AGT 


8 


PTK9 


NM 002822 


4506274 


5756 


D-0031 68-10 


GAGCGGATGCTGTATTCTA 


8 


PTK9 


NM_002822 


4506274 


5756 


D-0031 68-11 


CTGCAGACTTCCTTTATGA 


8 


PTK9L 














PTK9L 


NM 007284 


31543446 


11344 


D-0031 69-05 


AGAGAGAGCTCCAGCAGAT 


8 


PTK9L 


NM 007284 


31543446 


11344 


D-0031 69-06 


TTAACG AG GTG AAG ACAG A 


8 


PTK9L 


NM 007284 


31543446 


11344 


D-0031 69-07 


ACACAGAGCCCACGGATGT 


8 


PTK9L 


NM 007284 


31543446 


11344 


D-0031 69-08 


GCTGGGATCAGGACTATGA 


8 


RET 














RET 


NM 000323 


21536316 


5979 


D-0031 70-05 


GCAAAGACCTGGAGAAGAT 


8 


RET 


NM 000323 


21536316 


5979 


D-0031 70-06 


GCACACGGCTGCATGAGAA 


a 


RET 


NM 000323 


21536316 


5979 


D-0031 70-07 


GAACTGGCCTGGAGAGAGT 


8 


RET 


NM_000323 


21536316 


5979 


D-0031 70-08 


TTAAATGGATGGCAATTGA 


8 


ROR1 














ROR1 


NM 005012 


4826867 


4919 


D-0031 71 -05 


GCAAGCATCTTTACTAGGA 


8 


ROR1 


NM 005012 


4826867 


4919 


D-0031 71 -06 


GAGCAAGGCTAAAGAGCTA 


8 


ROR1 


NM 005012 


4826867 


4919 


D-0031 71 -07 


GAGAGCAACTTCATGTAAA 


8 


ROR1 


NM 005012 


4826867 


4919 


D-0031 71 -08 


GAGAATGTCCTGTGTCAAA 


8 


R0R2 














ROR2 


NM 004560 


19743897 


4920 


D-0031 72-05 


GGAACTCGCTGCTGCCTAT 


8 


ROR2 


NM_004560 


19743897 


4920 


D-0031 72-06 


GCAGGTGCCTCCTCAGATG 


8 


ROR2 


NM 004560 


19743897 


4920 


D-0031 72-07 


GCAATGTGCTAGTGTACGA 


8 


R0R2 


NM_004560 


19743897 


4920 


D-0031 72-08 


GAAGACAGAATATGGTTCA 


8 


ROS1 














ROS1 


NM 002944 


19924164 


6098 


D-0031 73-05 


GAGGAGACCTTCTTACTTA 


£ 


ROS1 


NM 002944 


19924164 


6098 


D-0031 73-06 


TTACAG AG GTTCAGG ATTA 


£ 


ROS1 


NM 002944 


19924164 


6098 


D-0031 73-07 


GAACAAACCTAAGCATGAA 


£ 


ROS1 


NM 002944 


19924164 


6098 


D-0031 73-08 


GAAAGAGCACTTCAAATAA 


£ 


RYK 














RYK 


NM 002958 


11863158 


6259 


D-0031 74-05 


GAAAGATGGTTACCGAATA 


£ 


RYK 


NM 002958 


11863158 


6259 


D-0031 74-06 


CAAAGTAGATTCTGAAGTT 


£ 


RYK 


NM 002958__ 


11863158 


6259 


D-0031 74-07 


TCACTACGCTCTATCCTTT 


£ 


RYK 


NM 002958 


11863158 


6259 


D-0031 74-08 


GGTGAAGGATATAGCAATA 


£ 


SRC 














SRC 


NM 005417 


21361210 


6714 


D-0031 75-05 


GAGAACCTGGTGTGCAAAG 


£ 


SRC 


NM 005417 


21361210 


6714 


D-0031 75-09 


GAGAGAACCTGGTGTGCAA 


£ 


SRC 


NM 005417 


21361210 


6714 


D-0031 75-10 


GGAGTTTGCTGGACTTTCT 


£ 


SRC 


NM 005417 


21361210 


6714 


D-0031 75-11 


GAAAGTGAGACCACGAAAG 


£ 


SYK 














SYK 


NM 003177 


21361552 


6850 


D-0031 76-05 


GGAATAATCTCAAGAATCA 


£ 


SYK 


NM 003177 


21361552 


6850 


D-0031 76-06 


GAACTGGGCTCTGGTAATT 


£ 


SYK 


NM 003177 


21361552 


6850 


D-0031 76-07 


GGAAGAATCTGAGCAAATT 


£ 


SYK 


NM 003177 


21361552 


6850 


D-0031 76-08 


GAACAGACATGTCAAGGAT 


£ 


TEC 














TEC 


NM 003215 


4507428 


7006 


D-0031 77-05 


GAAATTGTCTAGTAAGTGA 


£ 


TEC 


NM 003215 


4507428 


7006 


D-0031 77-06 


CACCIGAAGIGI I IAAI I A 


£ 


TEC 


NM 003215 


4507428 


7006 


D-0031 77-07 


GTACAAAGTCGCAATCAAA 


£ 


TEC 


NM 003215 


4507428 


7006 


D-0031 77-08 


TGGAGGAGATTCTTATTAA 


£ 


TEK 














TEK 


NM 000459 


4557868 


7010 


D-0031 78-05 


GAAAGAATATGCCTCCAAA 


i 



WO 2004/045543 



PCT/US2003/036787 



114 



TCI/ 
1 ulr\ 


mm nnnA^Q 

INIVI UUU^tOy 


400 f OOO 


701 n 
/ u i u 


n nn^i 7?k or 
u-uuo I / o-uo 


OO MM I OMOM I UrtrtM I I I 


Q 
O 


TCI/ 

1 cr\ 


mm nnnA^Q 
iNivi uuu^oy 


HOO / OOO 


/ U l U 


n nn^i 7R 07 

u-uuo I / O-U / 


T(^A AnTAPPTCATATTPTA 
I OrtrVJ i MOO I OM I M 1 1 Oln 


Q 
O 


TCI/ 

i ni\ 


mm nnnARQ 
inivi uuu^foy 


*+OD / ODO 


/U IU 


n nn^i 7P. op. 

U-UUO I t 0"UO 


a a A^i Ap , r*TAr v r^Trti a ATA 

UUAArtuMUU 1 AUu 1 OMM 1 M 


O 
O 


TIP 














TIP 
l it: 


MM C\C\RAOA 
INIVI 111104/^ 




/U/O 


u-uuo 1 /y-uo 


f^Af^ Aflir^ A^5^TTTATP;T^ A A 1 
oMoAvjoAoo 1 1 1 M 1 vj 1 oMM 


Q 
O 


TIP 


mm nn^49A 

INIVI UUO^fZ^f 


400000U 


707^ 


n nn^i 7Q or 
u-uuo i 1 y-uu 


fi^PlAPA^PPTPTAPPPTTA 
OUoMLtMuUU 1 0 1 AuUU 1 1 M 


O 


TIP 
1 IlZ 


INIVI UU04Z4 


400000U 


7n7^ 


n nn^i 7Q n7 
u-uuo 1 /y-u/ 


riA A CTTPTfiT^ pAA ATTf^ 
UMMO 1 bUMrtrt 1 1 OvJ 


Q 
O 


TIP 
1 It: 


INIVI 


^fooODOU 


/U/O 


n 00^*1 7Q HA 

u-uuo 1 / y-uo 


PAAPAT^^PPTPAf^AAPT^ 
LvMMOM 1 VjjvjOO 1 UAoMAu 1 \J 


n 

y 


1 Nr\ 1 














TMl^i 
1 INr\ 1 


mm nn^QR^ 
inivi uuoyoo 


40U / Q IU 


£71 1 
O/ I I 


u~uuo I ou-uo 


pTTPTn r» r*T a a htpt a a 

O 1 IUI V3UV3UU 1 MM\J3 1 V_/ 1 MM 


Q 


1 INI\ 1 


mm nn^QQi; 

inivi uuoyoo 


4-OU / D I U 


07H H 
0/ 1 1 


U-UUO I ou-uo 


/tt A APTCririTPTAPA A n A TO 


O 


1 NIX 1 


mm nniQQj; 

inivi uuoyoo 


H-OU / D I U 


0/ 1 1 


U-UUO I oU-U / 


A ^ A f2(~±T ATP^PTP ATP A 
LrbAbAbb 1 A 1 Uob 1 OA 1 oA 


o 


1 INI\ 1 


MM PiriQOQK 

inivi uuoyoo 


4-OU / O I U 


Q7i H 
O/ i 1 


U-UUO 1 0U-U0 


(Opr^rATprTPPApr'ATTA 
bbUbUA 1 1 bbAbUA 1 IA 




TVl/ 

1 Arv 














1 An*. 


mm nn^QOft 
INIVI_UUOOZo 


A KC\77A O 


1 zy^ 


u-uuo 1 0 1 -uo 


PAAPATPTATTPACAPAAP 
bAAUA 1 O 1 A 1 1 bAoAbAAo 


c 


TVl/ 
1 Arx 


mm nm^oft 

INIVI UUOOZo 


HO\J f I 4Z 


700A 

f zy*+ 


U-UUO 1 0 I -Uo 


TP A APPPAPTTTATP ATTT 

1 UAAbbbAU 1 1 1 A 1 bA 1 1 1 


c 


TVl/ 
1 Ar\ 


mm nriQQOQ 
INIVI UUOOZo 


A tZC\77A1 


/ zy^- 


pi nnn ft-i f\7 
U-UUo 1 0 I -U / 


Pif^Af^A^PiA ATPPPTATATT 

bbAbAbbAA 1 ooO I A 1 A 1 1 


c 


TYl/ 
1 Ar\ 


mm nn^QOft 

INIVI UUOOZo 


HOV 1 I 4Z 


( zy^ 


U-UUO 1 O 1 -UO 


nn ATATATPTP A APPA ATP 

o<oA 1 A 1 A 1 o I bAAbbAA 1 o 


c 


TVl/9 
1 Y r\Z 














1 Y r\Z 


Mm nn^QQi 
INIVI UUooo 1 


AKC\77AQ. 


7007 

. / zy / 


U-UUO 1 0Z-U0 


PAPPAP ATPPAPPAPTTTA 
VjAVjVjAvjA 1 OOAOOAO 1 1 1 A 




TVl/O 

I Yr\Z 


mm nn^Q'j'i 
INIVI UUOOO 1 


A CA77/1 Q 

4DU / / 4o 


70Q7 

/ zy / 


U-UUO 1 oZ-UO 


PPATrPArATTPPArATA A 
bUA 1 bbAUA 1 1 bUAUA 1 AA 


c 


TVl/O 
1 Yr\Z 


mm nnQQQ'i 
NM UUOOO 1 


A CH77/I Q 


7007 

/ zy / 


pi nnQ-i qo n7 
U-UUo 1 oZ-U 1 


TO A A ATAPPTAPPPAPAPT 

1 UAAA 1 AUU 1 AbbUAbAb 1 




1 Yr\Z 


INIVI UUOOO 1 


A CA77>1 Q 

4DU / / ho 


7007 

/ zy 1 


pi nnn qo no 
U-UUol 0Z-U0 


P* A ATPTTPPTP A PPTP1 — TO 

bAA loll VjjO 1 bAUb Ibl lb 




1 YKUo 














I YKUo 


NM UUoZyo 


07C07f\77 


( oUI 


U-UUol 00-Uo 


OOT AO A A OOTOTOOO ATTT 
VJVJ 1 AUAAUCj 1 KJ 1 bbbA 1 1 1 


r 


1 YKUo 


NM UUozyo 


07C07H77 

zY oy/U/ / 


/oUI 


U-UUol OO-UO 


A OOOTO A O AT — i — T A O A A PT A 

AoCso 1 oAvjA 1 1 1 AoAAo 1 A 


r 


1 YKUo 


NM UUozyo 


07C07A77 

ZYOyYU/ ( 


/OU1 


U-UUol oo-U/ 


r^r^ atppptppi — nrPTP a a a 
bbA 1 CdCpO 1 oo 1 II Cj 1 bAAA 




1 YKUo 


NM UUozyo 


ZYOy/U/ / 


7301 


D-0031 83-08 


O AP A OP* A APTAPP A AP ATP* 

UACjACjoAAo 1 AUChAACjA 1 o 




YES1 














YES1 


NM 005433 


21071041 


7525 


D-0031 84-05 


GAAGGACCCTGATGAAAGA 


< 


YES1 


NM 005433 


21071041 


7525 


D-0031 84-06 


TAAGAAGGGTGAAAGATTT 


C 


YES1 


NM 005433 


21071041 


7525 


D-0031 84-07 


TC AAG AAG CTC AG AT AATG 




YES1 


NM 005433 


21071041 


7525 |D-003184-08 


CAGAATCCCTGCATGAATT 


i 


Table VIII 



Gene 
Name 


Acc# 


" Gl 


Locus 
Link 


Duplex # 


Full Sequence 


SEQ. IE 
NO. 


APC2 














APC2 


NM 013366 


7549800 


29882 


D-003200-05 


GCAAGGACCTCTTCATCAA 


921 


APC2 


NM 013366 


7549800 


29882 


D-003200-06 


GAGAAGAAGTCCACACTAT 


922 


APC2 


NM 013366 


7549800 


29882 


D-003200-07 


GGAATGCCATCTCCCAATG 


923 


APC2 


NM 013366 


7549800 


29882 


D-003200-09 


CAACACGTGTGACATCATC 


924 


ATM 














ATM 


NM 000051 


20336202 


472 


D-003201-05 


G C AAG C AG CTG AAAC A AAT 


925 


ATM 


NM 000051 


20336202 


472 


D-003201-06 


GAATGTTGCTTTCTGAATT 


926 


ATM 


NM 000051 


20336202 


472 


D-003201-07 


GACCTGAAGTCTTATTTAA 


927 


ATM 


NM 000051 


20336202 


472 


D-003201-08 


AGACAGAATTCCCAAATAA 


928 


ATR 














ATR 


NM 001184 


20143978 


545 


D-003202-05 


G AAC AAC ACTG CTG GTTTG 


929 


ATR 


NM 001184 


20143978 


545 


D-003202-06 


GAAGTCATCTGTTCATTAT 


930 


ATR 


NM 001184 


20143978 


545 


D-003202-07 


GAAATAAGGTAGACTCAAT 


931 


ATR 


NM 001184 


20143978 


545 


D-003202-08 


CAACATAAATCCAAGAAGA 


932 


BTAK 
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D 1 Af\ 


NM 003600 


3213196 


6790 


D-003545-04 


C AAAG AATC AG CT AG C A AA 


933 


DT A IX 


NM OOobOO 


3213196 


6790 


D-003203-05 


GAAGAGAGTTATTCATAGA 


934 


D 1 Al\ 


NM 003600 


3213196 


6790 


D-003203-07 


CAAATGCCCTGTCTTACTG 


935 


o 1 r\D 


NM 0U3600 


00«| OH AP 

321 3196 


6790 


D-003203-09 


TCTCGTGACTCAGCAAATT 


936 


PPM A -1 

CONA1 
















NM 003914 


a <>~\ r\ r^- r~ i~\ o 

16306528 


8900 


D-003204-05 


GAACCTGGCTAAGTACGTA 


937 


PPM A A 

CCNA1 


KIM r\f\DC\A A 

NM 003914 


1 6306528 


8900 


D-003204-06 


GCAGATCCATTCTTGAAAT 


938 


CCNA1 


KIR A r\ f~\ t~\{~\A A 

NM 003914 


h *~\ r\ i~ c\ f\ 

1 6306528 


8900 


D-003204-07 


TCACAAG AATCAG GTGTTA 


939 


PPM A ^ 

CCNA1 


mm r\c\nc\A a 

NM 003914 


«4 oooo^*^r^ 

1 6306528 


8900 


D-003204-08 


CATAAAGCGTACCTTGATA 


940 


PPK1 A O 

CCNA2 














PPM A O 

CCNA2 


K 1 R h f\ O A ~r 

NM 001237 


16950653 


890 


D-003205-05 


GCTGTGAACTACATTGATA 


941 


PPM A O 

CCNA2 


NM 001237 


1 6950653 


890 


D-003205-06 


GATGATACCTACACCAAGA 


942 


PPM A O 

CCNA2 


NM 001237 


16950653 


890 


D-003205-07 


GCTGTTAGCCTCAAAGTTT 


943 


PPM A O 

CCNA2 


NM 001237 


1 6950653 


890 


D-003205-08 


AAG CTG G C CTG AATC ATT A 


944 


ppMD -1 

CCNB1 














PPMD-1 
CCNB1 


K 1 k x r\ o a a A/"» 

NM 031966 


14327895 


891 


D-003206-05 


CAACATTACCTGTCATATA 


945 


PPMD A 

CCNdI 


KIM AC<I AAA 

NM 031966 


14327895 


891 


D-003206-06 


CC AAATACCTG ATG G AACT 


946 - 


CCNB1 


k i h n a a a /*\ Af» 

NM 031966 


14327895 


891 


D-003206-07 


GAAATGTACCCTCCAGAAA 


947 


P PMD't 

CCNB1 


NM 031966 


14327895 


891 


D-003206-08 


G C AC CTG G CTAAG AATGTA 


948 


ppMDO 

CCNdZ 














PPMDO 

CCNB2 


NM 004701 


10938017 


9133 


D-003207-05 


CAACAAATGTCAACAAACA 


949 


PPMDO 

CCNB2 


NM 004701 


10938017 


9133 


D-003207-06 


GCAGCAAACTCCTGAAGAT 


950 


CCNB2 


kill A rf"V ,4 -tA v4 

NM 004701 


10938017 


9133 


D-003207-07 


CCAGTGATTTGGAGAATAT 


951 


CCNB2 


Kill f\ g-\ A <—r p. j 

NM 004701 


10938017 


9133 


D-003207-08 


GTGACTACGTTAAGGATAT 


952 


CCNB3 














P*PK 1 1~> O 

CCNB3 


Kill p\ p n p\ pv 

NM 033031 


14719419 


85417 


D-003208-05 


TG AAC AA ACTG CTG ACTTT 


953 


CCNB3 


NM 033031 


14719419 


85417 


D-003208-06 


GCTAGCTGCTGCCTCCTTA 


954 


CCNB3 


NM 033031 


14719419 


85417 


D-003208-07 


CAACTCACCTCGTGTGGAT 


955 


PPH IDO 

CCNB3 


NM 033031 


14719419 


85417 


D-003208-08 


GTGGATCTCTACCTAATGA 


956 


PPMP 
CCNC 














PPK IP 

CCNC 


Kill r\ f \ j— a /~\ f\ 

NM 005190 


7382485 


892 


D-003209-05 


GCAGAGCTCCCACTATTTG 


957 


PPKIP 

CCNC 


KIRK r\ /*\ /~ ^ ^ /™\ 

NM 005190 


7382485 


892 


D-003209-06 


GGAGTAGTTTCAAATACAA 


958 


PPMP 

CCNC 


NM 005190 


7382485 


892 


D-003209-07 


GACCTTTGCTCCAGTATGT 


959 


PPMP 

CCNC 


KIM AACHAA 

NM 005190 


7382485 


892 


D-003209-08 


GAGATTCTATGCCAGGTAT 


960 


p p K 1 A 

CCND1 














CCND1 


NM 053056 


1 6950654 


595 


D-00 32 10-05 


TGAACAAGCTCAAGTGGAA 


961 


CCND1 


K|M AITOAPO 

NM 053056 


1 6950654 


595 


D-0032 10-06 


CCAGAGTGATCAAGTGTGA 


962 


PPMPM 

CCND1 


Kill /"\ r— /-\ f— 

NM 053056 


1 6950654.. 


595 


D-00 32 10-07 


GTTCGTGGCCTCTAAGATG 


963 


CCND1 


K It i f\ r~ r*s r~\ r— r\ 

NM 053056 


1 6950654 


595 


D-0032 10-08 


CCGAGAAGCTGTGCATCTA 


964 


ppmpo 

CCND2 














CCNU2 


KllvJl a a a ~r tt r\ 

NM 001759 


1 6950656 


894 


D-003211-06 


TGAATTACCTGGACCGTTT 


965 


PPMHO 

CCNU2 


K IK 1 C\ A A T TT t~\ 

NM 001759 


16950656 


894 


D-003211-07 


CGGAGAAGCTGTGCA I I I A 


966 


CCNU2 


Kllt/1 AAH""7ITA 

NM 001759 


1 6950656 


894 


D-0032 11-08 


CTACAGACGTGCGGGATAT 


967 


CCNU2 


MA A AA<17CA 

NM 001759 


v< P\ P\ p- p\ p» p\ 

1 6950656 


894 


D-003211-09 


CAACACAGACGTGGATTGT 


968 


PPMP\o 

CCNUo 














PPMP\0 

CCNUo 


K IK ^ C\C\A 7AA 

NM 001760 


H AACAAr*~T 

16950657 


896 


D-0032 12-05 


GGACCTGGCTGCTGTGATT 


969 


ppMP\0 

CCNUo 


kim r\r\A "v a a 

NM 001760 


16950657 


896 


D-0032 12-06 


G ATT ATACCTTTG CC ATGT 


970 


CCND3 


MM A AH "" 7AA 

NM 001760 


16950657 


896 


D-003212-07 


GACCAGCACTCCTACAGAT 


971 


PPK ir\Q 

CCNUo 


K 1 A /I r\r\A 7AA 

NM 001760 


1 6950657 


896 


D-0032 12-08 


TG CG G AAG ATG CTG G CTTA 


972 


PPM TZA 

CCNbl 














CCNE1 


NM 001238 


1 f O I UJJU 


Oc/O 


L/-UVJO^ | O-UO 


AP~TP A PPTPPPP a A a ~r a 

yj I AO | bAbO 1 C^CjCCAAA PA 


973 


CCNE1 


NM 001238 


17318558 


898 


D-0032 13-06 


GGAAATCTATCCTCCAAAG 


974 


CCNE1 


NM 001238 


1 731 8558 


898 


D-003213-07 


GGAGGTGTGTGAAGTCTAT 


975 


CCNE1 


NM 001238 


1 731 8558 


898 


D-0032 13-08 


CTAAATGACTTACATGAAG 


976 


CCNE2 














CCNE2 


NM 057749 


17318564 


9134 


D-0032 14-05 


G G ATG G AACTC ATT AT ATT 


977 
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CCNE2 


NM 057749 


17318564 


9134 


D-0032 14-06 


GCAGATATGTTCATGACAA 


978 


CCNE2 


kill /-» ^- ■ ■ * 

NM 057749 


17318564 


9134 


D-003214-07 


CATAATATCCAGACACATA 


979 


CCNE2 


NM 057749 


17318564 


9134 


D-003214-08 


TAAGAAAGCCTCAGGTTTG 


980 


CCNF 














CCNF 


kill /-\ /-\ J — y *— \ A 

NM 001761 


4502620 


899 


D-003215-05 


TCACAAAGCATCCATATTG 


981 


CCNF 


NM 001761 


4502620 


899 


D-0032 15-06 


GAAGTCATGTTTACAGTGT 


982 


CCNF 


NM 001761 


4502620 


899 


D-003215-07 


TAGCCTACCTCTACAATGA 


983 


CCNF 


NM 001761 


4502620 


899 


D-0032 15-08 


GCACCCGGTTTATCAGTAA 


984 


CCNG1 














CCNG1 


NM 004060 


8670528 


900 


D-0032 16-05 


GATAATGGCCTCAGAATGA 


985 


CCNG1 


NM 004060 


8670528 


900 


D-003216-06 


GCACG G C A ATTG AAG C ATA 


986 


CCNG1 


NM 004060 


8670528 


900 


D-0032 16-07 


GGAATAGAATGTCTTCAGA 


987 


CCNG1 


NM 004060 


8670528 


900 


D-0032 16-08 


TAACTCACCTTCCAACAAT 


988 


CCNG2 














CCNG2 


NM 004354 


4757935 


901 


D-0032 17-05 


GGAGAGAGTTGGTTTCTAA 


989 


CCNG2 


NM 004354 


4757935 


901 


D-003217-06 


GGTGAAACCTAAACATTTG 


990 


CCNG2 


NM 004354 


4757935 


901 


D-003217-07 


GAAATACTGAGCCTTGATA 


991 


CCNG2 


NM 004354 


4757935 


901 


D-0032 17-08 


TGCCAAAGTTGAAGATTTA 


992 


CCNH 














CCNH 


NM 001239 


17738313 


902 


D-0032 18-05 


GCTGATGACTTTCTTAATA 


993 


CCNH 


NM 001239 


17738313 


902 


D-003218-06 


CAACTTAATTTCCACCTTA 


994 


CCNH 


NM 001239 


17738313 


902 


D-003218-07 


ATACACACCTTCCCAAATT 


995 


CCNH 


NM 001239 


17738313 


902 


D-0032 18-08 


GCTATGAAGATGATGATTA 


996 


CCNI 














CCNI 


NM 006835 


17738314 


10983 


D-003219-05 


GCAAGCAGACCTCTACTAA 


997 


CCNI 


NM 006835 


17738314 


10983 


D-003219-07 


TGAGAGAATTCCAGTACTA 


998 


CCNI 


NM_006835, 


17738314 


10983 


D-003219-08 


GGAATCAAACGGCTCTATA 


999 


CCNI 


NM 006835 


17738314 


10983 


D-0032 19-09 


GAATTGGGATCTTCACACA 


1000 


CCNT1 














CCNT1 


NM 001240 


17978465 


904 


D-003220-05 


TATCAACACTGCTATAGTA 


1001 


CCNT1 


NM 001240 


17978465 


904 


D-003220-06 


GAACAAACGTCCTGGTGAT 


1002 


CCNT1 


NM 001240 


17978465 


904 


D-003220-07 


GCACAAGACTCACCCATCT 


1003 


CCNT1 


NM 001240 


17978465 


904 


D-003220-08 


GCACAGACTTCTTACTTCA • 


d004 


CCNT2A 














CCNT2A 


NM 001241 


17978467 


905 


D-003221-05 


GCACAGACATCCTATTTCA 


1005 


CCNT2A 


NM 001241 


1 7978467 


905 


D-003221-06 


GCAGGGACCTTCTATATCA 


1006 


CCNT2A 


NM 001241 


1 7978467 


905 


D-003221-07 


G AAC AG CT ATATTC AC AG A 


1007 


CCNT2A 


NM_001241 


17978467. 


aos 


D-003221-09 


TTATATAGCTGCCGAGGTA 


1008 


CCNT2B 














CCNT2B 


NM 058241 


1 7978468 


905 


D-003222-05 


GCACAGACATCCTATTTCA 


1009 


CCNT2B 


NM 058241 


17978468 


905 


D-003222-06 


GCAGGGACCTTCTATATCA 


1010 


CCNT2B 


NM 058241 


17978468 


905 


D-003222-07 


G AAC AG CT ATATTC AC AG A 


1011 


CCNT2B 


NM 058241 


17978468 


905 


D-003222-08 


GGTGAAATGTACCCAGTTA 


1012 


CDC 16 














CDC16 


NM 003903 


14110370 


8881 


D-003223-05 


GTAGATGGCTTGCAAGAGA 


1013 


CDC16 


NM 003903 


14110370 


8881 


D-003223-06 


TAAAGTAGCTTCACTCTCT 


1014 


CDC16 


NM 003903 


14110370 


8881 


D-003223-07 


GCTACAAGCTTACTTCTGT 


1015 


CDC16 


NM 003903 


14110370 


8881 


D-003223-08 


TGGAAGAGCCCATCAATAA 


1016 


CDC2 














CDC2 


NM 033379 


27886643 


983 


D-003552-01 


GTACAGATCTCCAGAAGTA 


1017 




iNivi ujoo/y 


Z/ oooo4o 


9oo 


U-UUoooz-02 


GATCAACTCTTCAGGAI I I 


1018 


CDC2 


NM 033379 


27886643 


983 


D-003552-03 


GGTTATATCTCATCTTTGA 


1019 


CDC2 


NM 033379 


27886643 


983 


D-003552-04 


GAACTTCGTCATCCAAATA 


1020 


CDC20 














CDC20 


NM 001255 


4557436 


991 


D-003225-05 


GGGAATATATATCCTCTGT 


1021 


CDC20 


NM 001255 


4557436 


991 


D-003225-06 


GAAACGGCTTCGAAATATG 


1022 
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CDC20 


NM 001255 


4557436 


991 


D-003225-07 


GAAGACCTGCCGTTACATT 


1023 


CDC20 


NM 001255 


4557436 


991 


D-003225-08 


CACCAGTGATCGACACATT 


1024 


CDC25A 














CDC25A 


NM 001789 


4502704 


993 


D-003226-05 


GAAATTATGGCATCTGTTT 


1025 


CDC25A 


NM 001789 


4502704 


993 


D-003226-06 


TACAAGGAGTTCTTTATGA 


1026 


CDC25A 


NM 001789 


4502704 


993 


D-003226-07 


CCACGAGGACTTTAAAGAA 


1027 


CDC25A 


NM 001789 


4502704 


993 


D-003226-08 


TGGGAAACATCAGGATTTA 


1028 


CDC25B 














CDC25B 


NM 004358 


11641416 


994 


D-003227-05 


GCAGATACCCCTATGAATA 


1029 


CDC25B 


NM 004358 


11641416 


994 


D-003227-06 


CTAGGTCGCTTCTCTCTGA 


1030 


CDC25B 


NM 004358 


11641416 


994 


D-003227-07 


GAG AG CTG ATTGG AG ATTA 


1031 


CDC25B 


NM 004358 


11641416 


994 


D-003227-08 


AAAAGGACCTCGTCATGTA 


1032 


CDC25C 














CDC25C 


NM 001790 


12408659 


995 J 


D-003228-05 


GAGCAGAAGTGGCCTATAT 


1033 


CDC25C 


NM 001790 


12408659 


995 


D-003228-06 


CAGAAGAGATTTCAGATGA 


1034 


CDC25C 


NM 001790 


12408659 


995 


D-003228-07 


CCAGGGAGCCTTAAACTTA 


1035 


CDC25C 


NM 001790 


12408659 


995 


D-003228-08 


GAAACTTGGTGGACAGTGA 


1036 


CDC27 














CDC27 


NM 001256 


16554576 


996 


D-003229-06 


CATGCAAG CTGAAAG AATA 


1037 


CDC27 


NM 001256 


1 6554576 


996 


D-003229-07 


CAACACAAGTACCTAATCA 


1038 


CDC27 


NM 001256 


16554576 


996 


D-003229-08 


GGAGATGGATCC I A I I I AC 


1039 


CDC27 


NM 001256 


16554576 


996 


D-003229-09 


GAAAAGCCATGATGATATT 


1040 


CDC34 














CDC34 


NM 004359 


16357476 


997 


D-003230-05 


GCTCAGACCTCTTCTACGA 


1041 


CDC34 


NM 004359 


16357476 


997 


D-003230-06 


GGACGAGGGCGATCTATAC 


1042 


CDC34 


NM 004359 


16357476 


997 


D-003230-07 


GATCGGGAGTACACAGACA 


1043 


CDC34 


NM 004359 


16357476 


997 


D-003230-08 


TGAACG AG CCCAACACCTT 


1044 


CDC37 














CDC37 


NM 007065 


16357478 


11140 


D-003231-05 


GCGAGGAGACAGCCAATTA 


1045 


CDC37 


NM 007065 


16357478 


11140 


D-003231-06 


CACAAGACCTTCGTGGAAA 


1046 


CDC37 


NM 007065 


16357478 


11140 


D-003231-07 


ACAATCGTCATGCAATTTA 


1047 


CDC37 


NM 007065 


16357478 


11140 


D-003231-08 


GAGGAGAAATGTGCACTCA 


1048 


CDC45L 














CDC45L 


NM 003504 


34335230 


8318 


D-003232-05 


GCACACGGATCTCC I I I GA 


1049 


CDC45L 


NM 003504 


34335230 


8318 


D-003232-06 


GCAAACACCTGCTCAAGTC 


1050 


CDC45L 


NM 003504 


34335230 


8318 


D-003232-07 


TGAAGAGTCTGCAAATAAA 


1051 


CDC45L 


NM 003504 


34335230 


8318 


D-003232-08 


GGACGTGGATGCTCTGTGT 


1052 


CDC6 




• 


- 








CDC6 


NM 001254 


16357469 


990" 


D-003233-05 


GAACACAGCTGTCCCAGAT 


1053 


CDC6 


NM 001254 


16357469 


990 


D-003233-06 


GAGCAGAGATGTCCACTGA 


1054 


CDC6 


NM 001254 


16357469 


990 


D-003233-07 


GGAAATATCTTAGCTACTG 


1055 


CDC6 


NM 001254 


16357469 


990 


D-003233-08 


GGACGAAGATTGGTATTTG 


1056 


CDC7 














CDC7 


NM 003503 


11038647 


8317 


D-003234-05 


GGAATGAGGTACCTGATGA 


1057 


CDC7 


NM 003503 


11038647 


8317 


D-003234-06 


C AG G AAAG GTGTTC AC A AA 


1058 


CDC7 


NM 003503 


11038647 


8317 


D-003234-07 


CTACACAAATG C AC AAATT 


1059 


CDC7 


NM 003503 


1 1 038647 


8317 


D-003234-08 


GTACGGGAATATATGCTTA 


1060 


CDK10 














CDK10 


NM 003674 


32528262 


8558 


D-003235-05 


GAACTGCTGTTGGGAACCA 


1061 


CDK10 


NM 003674 


32528262 


8558 


D-003235-06 


GGAAGCAGCCCTACAACAA 


1062 


OL-Jr\ I U 


InIVI UUOD/4 




O £T £T O 

oooo 


l"\ AAOO^C t~\~7 


GCACGCCCAGTGAGAACAT 


1063 


CDK10 


NM 003674 


32528262 


8558 


D-003235-08 


GGAAGCAGCCCTACAACAA 


1064 


CDK2 














CDK2 


NM 001798 


16936527 


1017 


D-003236-05 


GAGCTTAACCATCCTAATA 


1065 


CDK2 


NM 001798 


16936527 


1017 


D-003236-06 


GAGCTTAACCATCCTAATA 


1066 


CDK2 


NM 001798 


16936527 


1017 


D-003236-07 


GTACCGAGCTCCTGAAATC 


1067 
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bUrvz 


mivvi nc\A Too 
N ivi uu I / y o 


i oyoooz / 


A C\A "7 
IU I / 


U-UUoZoO-UO 


rti A a a ri ri T O. T r* a ptt a a 


I UOO 


bUr\o 














bDr\o 


IMM UUIzoo 


A CCTyl QQ 
400/ 400 


1 U 1 O 
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D-003295-08 


G G AAACCACTATAG GC AAT 


1303 


r~> a x~\c\ a 

RAD9A 


Kit* AA ^ r A jl 

NM 004584 


19924112 


5883 


D-003295-09 


CGGACGACTTTGCCAATGA 


1304 


RB1 














RB1 


K Ik a A A A A AvI 

NM 000321 


1 99241 1 2 


5925 


D-003296-05 


GAAAGGACATGTGAACTTA 


1305 


1 — ) r-s ,/« 

RB1 


NM 000321 


1 99241 1 2 


5925 


D-003296-06 


GAAGAAGTATGATGTATTG 


1306 


RB1 


NM 000321 


4506434 


5925 


D-003296-07 


GAAATGACTTCTACTCGAA 


1307 


RB1 


NM 000321 


4506434 


5925 


D-003296-08 


G GAG G G AAC ATCT ATATTT 


1308 


RBBP2 














RBBP2 


NM 005056 


4826967 


5927 


D-003297-05 


CAAAG AAG CTG AATAAACT 


1309 


RBBP2 


NM 005056 


4826967 


5927 


P-003297-06 


C AAC AC AT ATG G CG G ATTT 


1310 


RBBP2 


NM 005056 


4826967 


5927 


D-003297-07 


GGACAAACCTAGAAAGAAG 


1311 


RBBP2 


NM 005056 


4826967 


5927 


D-003297-08 


GAAAGGCACTCTCTCTGTT 


1312 


nni a 

RBL1 














DDI A 

RBL1 


NM 002895 


34577078 


5933 


D-003298-05 


CAAGAGAAGTTGTGGCATA 


1313 


DDI A 

RBL1 


Kill AAAAAI" 

NM 002895 


34577078 


5933 


D-003298-06 


CAGCAGCACTCCATTTATA 


1314 


KBL1 


NM 002895 


34577078 


5933 


D-003298-07 


ACAGAAAGGTCTATCATTT 


1315 


DDI H 

KBL1 


hill /-\ » r\ r\ I— 

NM 002895 


34577078 


5933 


D-003298-08 


GGACATAAAGTTACAATTC 


1316 


DDI O 

KBL2 














DDI O 

RBL2 


NM 005611 


21361291 


5934 


D-003299-05 


GAGCAGAGCTTAATCGAAT 


1317 


DDI O 

RBL2 


kill /■» J-V |— fm. . j, 

NM 005611 


21361291 


5934 


D-003299-06 


GAGAAXAGCCCTTGTGTGA 


1318 


DDI O 

RBL2 


NM 005611 


21361291 


5934 


D-003299-07 


GGACTTAGTTTATGGAAAT 


1319 


DDI O 

RBL2 


NM 005611 


21361291 


5934 


D-003299-08 


GAATTTAGATGAGCGGATA 


1320 


RBP1 
















NM 002899 


8400726 


5947 


D-003300-05 


GAGACAAGCTCCAGTGTGT 


1321 


KBK1 


NM 002899 


8400726 


5947 


D-003300-06 


GCAAGCAAGTATTCAAGAA 


1322 


KBK1 


NM 002899 


8400726 


5947 


D-003300-07 


G C AG G ACG GTG ACC AT ATG 


1323 




Kin» nnoonn 

NM 002899 


8400726 


5947 


D-003300-08 


GCAAGTGCATGACAACAGT 


1324 


DD A Q 

KKAo 














nn A o 

KKAo 


Kin a r\r\r\f^ 

NM 002947 


19923751 


6119 


D-003322-05 


GGAAGTGGTTGGAAGAGTA 


1325 


DD A Q 

KKAo 


K 1 K M f\ /"\ 0/~K A "~7 

NM 002947 


19923751 


6119 


D-003322-06 


GAAGATAGCCATCC I I I TG 


1326 


DD A O 

KPAo 


K IK 4 f\ r\ AA A ~y 

NM 002947 


19923751 


6119 


D-003322-07 


CATGCTAGCTCAATTCATC 


1327 


DD A O 

KPAo 


NM 002947 


19923751 


6119 


D-003322-08 


G ATCTTG G ACTTTAC AATG 


1328 


SKP1A 














SKP1A 


NM 006930 


25777710 


6500 


D-003323-05 


GGAGAGATATTTGAAGTTG 


1329 


SKP1A 


NM 006930 


2577771 0 


6500 


D-003323-06 


G G G AATG G ATG ATG AAG G A 


1330 


SKP1A 


NM 006930 


25777710 


6500 


D-003323-07 


CAAACAATCTGTGACTATT 


1331 


SKP1A 


NM 006930 


25777710 


6500 


D-003323-08 


TCAATTAAGTTGCAGAGTT 


1332 


SKP2 
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SKP2 


NM 005983 


1 6306594 


650? 


U-UUOO/C4- UD 


UA I U I AoAOTTAAGTGATA 


1333 


SKP2 


NM 005983 


1 6306594 


650? 


U-UU00^4-UD 


UAAA I UACjATCTCTCTACT 


1334 


SKP2 


NM 005983 


1 6306594 


6509 


LJ-UUOOZ'4— U / 


O I AAACjvjTCTCTGGTG I I I 


1335 


SKP2 


NM 005983 


1 6306594 


650? 


LJ-UUooZ^-UO 


bA 1 oCj 1 AOCCTTCAACTGT 


1336 


SNK 














SNK 


NM 006622 


5730054 


1076Q 


u-uuoozo-uo 


A A f*™* A f^* A "T f*^"T" A /"^ A A /-n /— v— i — j— « 

bAAbAOA ICTACAAGCTTA 


1337 


SNK 


NM 006622 


5730054 


1 076Q 


ri-nfi^^oc: net 


oAAA 1 AOGTTCATGAACAA 


1338 


SNK 


NM 006622 


5730054 


1 076Q 


U-UUOO<iO-U / 


oAAvjCj I OAATGGCTCATAT 


1339 


SNK 


NM 006622 


5730054 


1076Q 


U-UUOOZO-Ub 


UUUCjAGATCTCGCGGATTA 


1340 


STK12 














STK12 


NM 004217 


47591 77 


Q91 9 


U-UUOOZD-U / 


UACjAAGAGCTGCACA 1 1 IG 


1341 


STK12 


NM 004217 


4759177 


Q919 


L/-UU00ZO-U0 


AAACTG CTC AG G C ATAA 


1342 


STK12 


NM 004217 


47591 77 


Q919 


n nn^^oft no 

LJ-UUoOZD-Uy 


AGGCGGCACTTCACAATTG 


1343 


STK12 


NM 004217 


4759177 


Q91 9 

C/Z. J Zl 


n nmQOft -in 

U-UUoOZD- IU 


I GGGACACCCGACATCTTA 


1344 


TFDP1 














TFDP1 


NM 007111 


34147667 

V»**t* l*t/ \J\J I 


7H97 


U-UUOOZ / -Uo 


G GAAG C AG CTCTTG C C AAA 


1345 


TFDP1 


NM 007111 


34147667 


7097 

f VJ/C / 


n nnQQ07 n« 
u-uuooz / -Uo 


/*" * A Z - * A A | | 1 AAA A A -T" A 

GAGGAGACTTGAAAGAATA 


1346 


TFDP1 


NM 007111 


34147667 


7D97 


U-UUooZ / -u / 


GAACTTAGAGGTGGAAAGA 


1347 


TFDP1 


NM 007111 


34147667 


7H97 


U-UUooZ / -Uo 


GCGAGAAGGTGCAGAGGAA 


1348 


TFDP2 














TFDP2 


NM 006286 


54541 1 1 


7H9Q 


Pi nnQQOQ nc 


GAAAGTGTGTGAGAAAGTT 


1349 


TFDP2 


NM 006286 


54541 1 1 


7H9Q 


U-UUOOZO-UD 


OAOAGGACCTTCTTGGTTA 


1350 


TFDP2 


NM 006286 


54541 1 1 


7H9Q 


u-uuoozo-U / 


CG AA ATCCCTG GTG CC AAA 


1351 


TFDP2 


NM 006286 

■ 1IVI \J \J \J C~ KJ \J 


54^41 1 1 




U-UUoozo-Uo 


TGAGATCCATGATGACATA 


1352 


TP53 














TP53 


NM 000546 


84007^7 

y+uvj /Of 


71 c;7 

/ I o / 


u-uuoozy-Ub 


GAGGTTGGCTCTGACTGTA 


1353 


TP53 


NM 000546 


84007^7 

0*TV/vv /Of 


71 R7 
1 1 O / 


LJ-UUoo^y-UD 


CAGTCTACCTCCCGCCATA 


1354 


TP53 


NM 000546 


84007^7 

U"tUU 1 O f 


71 

/ I o / 


U-UUoozy-U / 


GCACAGAGGAAGAGAATCT 


1355 


TP53 


NM 000546 


84007^7 

UTUU / O / 


71 

/ l o / 


u-uuooiiy-Uo 


GAAGAAACCACTGGATGGA 


1356 


TP63 














TP63 


NM 003722 


31543817 

<J 1 \J*T"00 1 f 


£696 


Pi nriQQ*3n nc 


CATCATGTCTGGACTAI 1 1 


1357 


TP63 


NM 003722 


31543817 • 


ft696 


U-UUOOOU-UD 


f ^ AAA A A a "l""l~^*v a a -i -i a 

OAAACAAGATTGAGATTAG 


1358 


TP63 


NM 003722- 


"31 54381 7 


AR9R 

OUcU 


L^-UUoooU-U / 


GCACACAGACAAATGAATT 


1359 


TP63 


NM 003722 


31543817 


ft696 
uu^.u 


Pi nn^^Qn no 

L'-UUoooU-Uo 


UGACAGTCTTGTACAATTT 


1360 


TP73 














TP73 


NM 005427 


4885644 


71 61 
1 lu I 


Pi nn^^Q-i nc 
u-uuooo l -Uo 


G CAAG CAGCCCATCAAGG A 


1361 


TP73 


NM 005427- 


4885644 


71 61 


U-UUooo 1 -uo 


GAGACGAG G ACACGTACTA 


1362 


TP73 


NM 005427 

■ HVI V W V-/"^ / 


4885644 


71 61 

MO I 


U-UUOooT-U^ 


CTG CAG AAGCTG ACC ATTG 


1363 


TP73 


NM 005427 


4885644 


7161 


D-003331-08 


G G CC ATGCCTGTTT AC AAG 


1364 


YWHAZ 














YWHAZ 


NM 003406 


21735623 


7534 


D-003332-05 


GCAAGGAGCTGAATTATCC 


1365 


YWHAZ 


NM 003406 


21735623 


7534 


D-003332-06 


TAAGAGATATCTGCAATGA 


1366 


YWHAZ 


NM 003406 


21735623 


7534 


D-003332-07 


GACGGAAGGTGCTGAGAAA 


1367 


YWHAZ 


NM 003406 


21735623 


7534 


D-003332-08 


AGAGCAAAGTCTTCTATTT 


1368 



Table IX 



Gene 
Name 


Accession # 


Gl# 


Duplex # 


Sequence 


SEQ. ID 
NO. 


AR 


NM 000044 


21322251 


D-003400-01 


GGAACTCGATCGTATCATT 


1369 


AR 


NM 000044 


21322251 


D-003400-02 


CAAGGGAGGTTACACCAAA 


1370 


AR 


NM_000044 


21322251 


D-003400-03 


TCAAGGAACTCGATCGTAT 


1371 


AR 


NM 000044 


21322251 


D-003400-04 


GAAATGATTGCACTATTGA 


1372 ! 














ESR1 


NM 000125 


4503602 


D-003401-01 


GAATGTGCCTGGCTAGAGA 


1373 


ESR1 


NM 000125 


4503602 


D-003401-02 


CATGAGAGCTGCCAACCTT 


1374 


ESR1 


NM_000125 


4503602 


D-003401-03 


AGAGAAAGATTGGCCAGTA 


1375 
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tor\ 1 i 


NIVI \J\J\J 1 £-\J 


i-503602 [ 


3-003401-04 ( 


:aaggagactcgctactgt 


1376 I 















IZOrxZ, l 


NIVI \J\J 1 t\J / 


0835012 I 


D-003402-01 ( 


3AACATCTGCTCAACATGA 


1377 




sIM 001437 ' 

NIVI \J\j 1 "TO/ 


10835012 t 

1 UUVVV 1 4— L 


D-003402-02 ( 


3CACGGCTCCATATACATA 


1378 


ccDo r 


sIM 001437 ' 

NIVI \J\J l*-r\JI 


10835012 I 


D-003402-03 ( 


DAAGAAGATTCCCGGCTTT 


1379 


CQDO 1 


xlM 001437 ' 

NIVI \J\J Itwl 


10835012 I 


D-003402-04 < 


3GAAATGCGTAGAAGGAAT 


1380 














pCDRA 
tZOr\r\A ( 1 


MM 004451 

NIVI UU'r I vJ 1 


18860919 


D-003403-01 


3GCCTTCGCTGAGGACTTA 


1381 




MM 004451 

NIVI UUttvJ I 


1 886091 9 


D-003403-02 * 


TGAATGCACTGGTGTCTCA 


1382 


CODDA 


MM 004451 

NIVI UUt'TiJ 1 


18860919 


D-003403-03 


G C ATTG AG CCTCTCT AC AT 


1383 


PQDDA 


MM 004451 

NIVI WWt*tO 1 


18860919 


D-003403-04 


CC AG AC AG CG G G C A A AGTG 


1384 














PQDRR 

COrvrxD 


MM 004459 

NIVI WW*T*+0£_ 


22035686 


D-003404-01 


TACCTGAGCTTACAAATTT 


1385 


PQDRR 


MM 004459 

NIVI \JW+*+0/- 


22035686 


D-003404-02 


GCACTTCTATAGCGTCAAA 


1386 


PCDDR 


MM 004459 

INIVl \JU e + £ +Oii. 


99035686 


D-003404-03 


CAACTCCGATTCCATGTAC 


1387 


PQDDD 


MM 004452 

INIVl VJW*t*tO£. 


22035686 


D-003404-04 


GGACTCGCCACCCATGTTT I 


1388 














PQDDr; 


MM 001438 

INIVl \J\J I tOU 


4503604 


D-003405-01 


AAACAAAGATCGACACATT 


1389 


cor\i\U 


NM 001438 

INIVl \J\J I 'rOt-J 


4503604 


D-003405-02 


TCAGGAAACTGTATGATGA 


1390 


pcDRrj 
toKr\b 


MM 001438 

INIVl vJU I H-OO 


4503604 


D-003405-03 


GAAGACCAGTCCAAATTAG 


1391 


pcpRr; 
torxrxvj 


MM 001438 
INIVI uu i too 


4503604 


D-003405-04 


ATGAAGCGCTGCAGGATTA 


1392 














UMp/iA 


MM 000457 

INIVI VJUVJH-O/ 


913R1 184 


D-003406-01 


CGACATCACTGGAGCATAT 


1393 


UMC/I A 


MM 0004^7 


91 361 1 84 


D-003406-02 


GAAGGAAGCCGTCCAGAAT 


1394 


UMCT/1 A 

riNr4A 


mm nno4^7 

INIVI UUUh-O/ 


91361 184 

£. I OU I I 0*t 


D-003406-03 


CCAAGTACATCCCAGCTTT 


1395 


MNr4A 


MM 0004^7 
InIVI UUU'fO/ 


91 361 1 84 

C. I OvJ 1 1 0*T 


D-003406-04 


GGACATGGCCGACTACAGT 


1396 














nlNr4o 


MM 0041 33 
InIVI UU*+ I OO 


6631 087 


D-003407-01 


G CACTG ACATAA ACGTTAA 


1397 


rllNr^Kj 


MM OOA1 

INIVI wVJH I OO 


6631 087 


D-003407-02 


ACAAAGAGATCCATGATGT 


1398 




MM 004133 

INIVI V/VJH- I OO 


6631 087 


D-003407-03 


AGAGATCCATGATGTATAA 


1399 


UMP4ri 


MM 0041 33 

INIVI VJU^t I OO 


6631 087 

WO I WvJ i 


D-003407-04 


AAATGAACGTGACAGAATA 


1400 














UQA \OAOR 


MM 017^39 


8Q23776 


D-003408-01 


GAATGAATCTACACCTTTG 


1401 


UQA \OAOR 


NM 01 7^39 

INIVI U I I OOt 


"8923776 


D-003408-02 


GGAAATACGTGGAGACACT 


1402 


UQA 


MM 017539 

INIVI \J I f OOt, 


8923776 


D-003408-03 


CCAGATAACTACGGCGATA 


1403 


UQA \OAOR 


MM 017539 

INIVI U I I 00<£- 


8923776 


D-003408-04 


TGGCGTACCTTCTCATTGA 


1404 














MpnR1 


MM 000475 

INIVI WVJWH* / O 


501 6089 


D-003409-01 


C AG C ATG G ATG AT AXG ATG 


1405 


MPnR'i 

NKUD 1 


MM 000475 

INIVI UUUH- # o 


5016089 


D-003409-02 


CTGCTGAGATTCATCAATG 


1406 


MRHR'1 

iNrxUD 1 


MM 000475 

INIVI vJUUH-zO 


5016089 

OVJ 1 uuuc 


D-003409-03 


ACAGATTCATCGAACTTAA 


1407 


INrxUD I 


MM 000475 


5016089 


D-003409-04 


GAACGTGGCGCTCCTGTAC 


1408 














MRHR9 


MM 091 Q6Q 

INIVI \J £- I C3 VJC 


1 3259502 


D-0034 10-01 


GAATATGCCTGCCTGAAAG 


1409 




MM 091 Q6Q 

INIVI \J £- I <J\J<J 


1 3259502 


D-0034 10-02 


GGAATATGCCTGCCTGAAA 


1410 


MROR9 


MM 021969 

INIVl i zf\JxJ 


1 3259502 


D-003410-03 


CGT AG CCG CTG CCT ATGT A 


1411 


MDnDO 

In r\\J iD/l. 


NM 021969 

INIVl \J 1 \J\J\J 


13259502 

1 w w v^y v„' £i_ 


D-003410-04 


GCC ATTCTCTACG CACTTC 


1412 














mri ni 


MM 021724 

INIVl W 1 I 


1 3430847 


D-003411-01 

i—S www" 1 I X*r 1 


CAACACAGGTGGCGTCATCTT 


1413 


mri ni 

INIx l LJ i 


NM 021724 

INIVl W 1 I ^.*t 


1 3430847 


D-0034 11-02 


G G C ATG GTGTT ACTGTGT ATT 


1414 


mri ni 


NM 021724 

INIVI UiC 1 1 " 


1 3430847 


' D-003411-03 


CAACATGCATTCCGAGAAGTT 


1415 


NR1D1 


NM 021724 


13430847 


r D-003411-04 


GCGCTTTGCTTCGTTGTTCTT 


1416 














NR1 H2 


NM 007121 


1132162G 


) D-003412-01 


GAACAGATCCGGAAGAAGA 


1417 


NR1H2 


NM 007121 


1132162^ 


) D-003412-02 


I GAAGAACAGATCCGGAAGA 


1418 


NR1H2 


NM 007121 


1132162$ 


) D-003412-0G 


I CTAAGCAAGTGCCTGGTTT 


1419 


NR1H2 


NM 007121 


1132162< 


) D-003412-0^ 


I GCTAACAGCGGCTCAAGAA 


1420 



-I ^ c 



WO 2004/045543 
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NR1H3 




^n^i RQ9 
ouo i oyz 


U-UUo41 x3-01 


GAACAGATCCGCCTGAAGA 


1421 


NR1H3 


INIVI UU«JUv/v 


kc\oa &Q9 
ouo i oyz 


U-uUo41 o-Uz 


G G AG AT AGTTG ACTTTG CT 


1422 


NR1H3 


NM OO^RQS 4 

I NIVI UUOvwO 


^0^1 AQ9 

ouo i oyz 


U-Uuo41 0-(J%3 


G AGTTTG CCTTG CTC ATTG 


1423 


NR1H3 


NM OO'SRCR 

INIVI UUUUOO 


cn^l AQ9 

ouo 1 oyz 


U-UUo41 O-04 


TGAUI 1 1 GU I AAACAGCTA 


1424 














NR1 H4 


NM 00^1 9*3 


H-ozoy / y 


U-UUo414-01 


CAAGTGACCTCGACAACAA 


1425 


NR1 H4 


NM 00^19^ 
inivi uuo j zo 


4ozoy / y 


U-U034 14-02 


GAAAGAATTCGAAATAGTG 


1426 


NR1H4 


NM 00^19^ 

INIVI \J\J\J 1 zo 


*+ozoy / y 


U-uUo414-0o 


CAACAGACTCTTCTACATT 


1427 


NR1 H4 


NM 00^1 9^ 

INIVI \J\J\j I ZO 


Hozoy/y 


r> /™\r\o a a a r\A 
D-Uuo4 14-04 


GAACCATACTCGCAATACA 


1428 














NR1 12 


NM OO^RAQ 
inivi vjuoooy 


1 1 OUO 1 OO 


U-UUo41 5-01 


GAACCATGCTGACTTTGTA 


1429 


NR1 12 


INIVI UUOOOv? 


1 1 ODO 1 OO 


U-UUo41 o-Oz 


GATGGACGCTCAGATGAAA 


1430 


NR1 12 


inivi uuoooy 


1 1 ODO 1 OO 


p\ nno a a c no 
U-UUo41 o-Uo 


CAACCTACATGTTCAAAGG 


1431 


NR1 12 


nm nn^R^Q 

INIVI UUOOOO 


I I OOO I OO 


r~> nno a a a r\ a 
U-UUo41 0-04 


CAGGAGCAATTCGCCATTA 


1432 














NR1I3 


nm 00^199 

INIVI UUJ I ZZ 


40ZDODU 


r> nno a a 0 r\A 

L)-00o41d-01 


GGAAATCTGTCACATCGTA 


1433 


NR1 13 


KIM 00^199 

INIVI UUO I ZZ 


4OZ00DU 


nno a a t~* r\'~% 

U-UUo41 o-Oz 


TCGCAGACATCAACACTTT 


1434 


NR1I3 


nm nn^i 99 

INIVI UUO I ZZ 


^OZDDOU 


r^i nno a a r\ 0 

U-003416-03 


CCTCTTCGCTACACAATTG 


1435 


NR1I3 


NM OOR199 

INIVI UUO | ZZ 


^OZODOU 


r> nno a a r\ a 

U-00o41d-04 


GAACAGTTTGTGCAGTTTA 


1436 














NR2C1 


NM On^9Q7 
inivi uuozy/ 


4oU ( D/ z 


n\ r\r\oAA"7 r\ a 

U-003417-01 


TG AC AG C ACTTG ATC ATAA 


1437 


NR2C1 


NM 00*3907 

INIVI UUOZy/ 


40U/0/Z 


n\ nc\o a a "7 /"*i<~» 

U-U0o41 7-02 


GGAAGGAAGTGTACACCTA 


1438 


NR2C1 

1 N 1 1 


inivi uuozy/ 


A CA7R70 
40U/0/Z 


D-003417-03 


GAG C AC ATCTTC AAACTAC 


1439 


NR2C1 


nm nn^9Q7 


4ouYo/z 


D-0034 17-04 


GAAGAAATTGCACATCAAA 


1440 














NR9C9 


mm nn^oQp 


4ou/D/4 


D-003418-01 


G AAC AACG GTG AC ACTTC A 


1441 


NR2C2 


nm oo^qqp. 


40U/D/4 


D-0034 18-02 


CTGATGAGCTCCAACATAA 


1442 


NR2C2 


NM 00^9Q# 


yl Kn7«VyI 

4DU/D/ 4 


nno a a 0 r\ 0 

U-UUo41o-03 


CAACCTAAGTGAATCTTTG 


1443 


NR2C? 


NM nn^9Qft 
inivi uuozyo 


40U/ O/ 4 


D-003418-04 


GAAGACACCTACCGATTGG 


1444 














NR2E1 


nm oo^9rq 

INIVI UUOcOo 


9*1 Qft-I 4. no 

z 1 00 1 1 Uo 


nno AAC\ r\ a 

U-003419-01 


GATCATATCTGAAATACAG 


1445 


NR2E1 


NM 00^9RQ 
INIVI uuozuy 


Z 1 ODtel UO 


r> nno a a n 

D-Uuo41 9-02 


CAAGACTGCTTTCAGATAT 


1446 


NR2E1 


NM 003269 


21361108 


D-003419-03 


GTTAGATGCTACTGAATTT 


1447 


NR2E1 


NM 003269 


21361108 


D-0034 19-04 


CAATGTATCTCTATGAAGT 


1448 














NR2E3 


MM DIAOAQ 
inivi u 1 'H-z^y 


/DO/ oy4 


D-003420-01 


GAGAAGCTCCTTTGTGATA 


1449 


NR2E3 


inivi u 1 ^zn-y 


/ DO/ oy4 


nno a on r\ 0 

D-003420-02 


G AAG C ACT ATG G CATCXAX 


1450 


NR2E3 
NR2E3 


MM OA AO AO 
nivi u 1 *+z*+y 

NM 01zi9zlQ 


/ DO/ 0«4 
7«cyoQ / i 


D-UUo4zu-03 

p> nno /i on n ^ 


GAAGGATCCTGAGCACGTA 


1451 




inivi u i^+z^ry 


/do/ oy4 


D-UUo4zO-04 


G AAG CTCCTTTGTG ATATG 


1452 


NR2F1 


NM OOSRoVl 

inivi uu ju Jt- 


9H1 97/lft/l 
ZU I Z / 4o4 


U-UUo4z1-U1 


GAAACTCTCATCCGCGATA 


1453 


NR2F1 


NM OO^iRoM 

INIVI UUvJU Jt 


901 974A4 
ZU I Z / 4o4 


p\ nno a 0 -1 no 
U~Uuo4z1-(Jz 


TCTCATCCGCGATATGTTA 


1454 


NR2F1 


NM OO'SR^A 

INIVI UuOUJ't 


901 97/lp/l 
ZU 1 Z / *K54 


D-UUo4z1-(Jo 


CAAGAAGTGCCTCAAAGTG 


1455 


NR2F1 


mm nn^R^ii 

NIVI UUOUO't 


901 

ZU I Z/ 4o4 


p\ nno /i oh n>i 
D-UUo4z1 -04 


GGAACTTAACTTACACATG 


1456 














NR2F2 


NM 091 nn^ 

INIVI \J £— 1 UUvJ 


1 A*\ AOTA^ 
I | £+y / 0 


U-UUo4zz-U1 


GTACCTGTCCGGATATATT 


1457 




nm 09100^ 

INIVI U/C I uuo 


H yl A A CV7A CT 

I 4 l4y/4o 


P\ nno A 0 0 no 

D-003422-02 


CCAACCAGCCGACGAGATT 


1458 


NR2F2 


nm 091 noo* 

INIVI W/C I UUU 


I 1 4y / 4o 


n> nn 0 /i 00 no 
U-uuo4zz-Uo 


ACTCGTACCTGTCCGGATA 


1459 


NR2F2 


NM 091 no^ 

INIVI W/l. 1 UUU 


I h- I 4y / 4o 


D-Uuo4zz-U4 


GGCCGTATATGGCAATTCA 


1460 














NR2F6 


NM 005234 


20070198 


D-003423-01 


CG ACG CCTGTG GCCTCTCA 


1461 


NR2F6 


NM 005234 


20070198 


D-003423-02 


CAGCCGGTGTCCGAACTGA 


1462 


NR2F6 


NM 005234 


20070198 


D-003423-03 


CAACCGTGACTGCCAGATC 


1463 


NR2F6 


NM 005234 


20070198 


D-003423-04 


GTACTGCCGTCTCAAGAAG 


1464 
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NR3C1 


NM 000176 


4504132 


D-003424-01 


GAG G ACAG ATGTACCACTA 


1465 


NR3C1 


NM 000176 


4504132 


D-003424-02 


GATAAGACCATGAGTATTG 


1466 


NR3C1 


NM 000176 


4504132 


D-003424-03 


GAAGACGATTCATTCCTTT 


1467 


NR3C1 


NM 000176 


4504132 


D-003424-04 


GGACAGATGTACCACTATG 


1468 














NR3C2 


NM 000901 


4505198 


D-003425-01 


GCAAACAGATGATCCAAGT 


1469 


NR3C2 


NM 000901 


4505198 


D-003425-02 


C AG CT AAG ATTT ATCAG AA 


1470 


NR3C2 


NM 000901 


4505198 


D-003425-03 


GCACGAAAGTCAAAGAAGT 


1471 


NR3C2 


NM 000901 


4505198 


D-003425-04 


GGTATCCGGTCTTAGAATA 


1472 














NR4A1 


NM 002135 


21361341 


D-003426-01 


G AAG GAAGTTGTCCG AACA 


1473 


NR4A1 


NM 002135 


21361341 


D-003426-02 


CAGGAGAGTTTGACACCTT 


1474 


NR4A1 


NM 002135 


21361341 


D-003426-03 


CAGTGGCTCTGACTACTAT 


1475 


NR4A1 


NM 002135 


21361341 


D-003426-04 


GAAGGCCGCTGTGCTGTGT 


1476 














NR4A2 


NM 006186 


5453821 


D-00 3427-01 


GCAATGCGTTCGTGGCTTT 


1477 


NR4A2 


NM 006186 


5453821 


D-003427-02 


CGGCTACACAGGAGAGTTT 


1478 


NR4A2 


NM 006186 


5453821 


D-003427-03 


CCACGTGACTTTCAACAAT 


1479 


NR4A2 


NM 006186 


5453821 


D-003427-04 


GAATACAGCTCCGATTTCT 


1480 














NR4A3 


NM 006981 


11276070 


D-003428-01 


CAAAGAAGATCAGACATTA 


1481 


NR4A3 


NM 006981 


11276070 


D-003428-02 


GATCAGACATTACTTATTG 


1482 


NR4A3 


NM 006981 


11276070 


D-003428-03 


CCAGAGATCTTGATTATTC 


1483 


NR4A3 


NM 006981 


11276070 


D-003428-04 


GAAGTTGTCCGTACAGATA 


1484 














NR5A1 


NM 004959 


20070192 


D-003429-01 


GATTTGAAGTTCCTGAATA 


1485 


NR5A1 


NM 004959 


20070192 


D-003429-02 


GGAGCGAGCTGCTGGTGTT 


1486 


NR5A1 


NM 004959 


20070192 


D-003429-03 


GGAGGTGGCCGACCAGATG 


1487 


NR5A1 


NM 004959 


20070192 


D-003429-04 


CAACGTGCCTGAGCTCATC 


1488 














NR5A2 


NM 003822 


20070161 


D-003430-01 


CCAAACATATGGCCACTTT 


1489 


NR5A2 


NM 003822 


20070161 


D-0034304)2 


TCAGAGAACTTAAGGTTGA 


1490 


NR5A2 


NM 003822 


20070164* 


O-003430-03 


GGATCCATCTTCCTGGTTA 


• 1491 


NR5A2 


NM 003822 


20070161 


D-003430-04 


AAGAATACCTCTACTACAA 


1492 














NR6A1 


NM 033334 


15451847 


D-003431-01 


CAACGAACCTGTCTCATTT 


1493 


NR6A1 


NM 033334 


15451847 


D-003431-02 


GAAGAACTACACAGATTTA 


1494 


NR6A1 


NM 033334 


15451847 


D-003431-03 


G AAG ATGG ATACG CTGJGA 


1495 


NR6A1 


NM 033334 


15451847 


D-003431-04 


AAACG ATACTG GTACATTT" 


1496 














null 


D16815 


2116671 


D-003432-01 


GAAGAATGATCGAATAGAT 


1497 


null 


D16815 


2116671 


D-003432-02 


GAACATGGAGCAATATAAT 


1498 


null 


D16815 


2116671 


D-003432-03 


GAGGAGCTCTTGGCCTTTA 


1499 


null 


D16815 


2116671 


D-003432-04 


TAAACAACATGCACTCTGA 


1500 














PGR 


NM 000926 


4505766 


D-003433-01 


GAGATGAGGTCAAGCTACA 


1501 


PGR 


NM 000926 


4505766 


D-003433-02 


CAG CGTTTCTATCAACTTA 


1502 


PGR 


NM 000926 


4505766 


D-003433-03 


AGATAACTCTCATTCAGTA 


1503 


PGR 


NM 000926 


4505766 


D-003433-04 


GTAGTCAAGTG GTCTAAAT 


1504 














rrAfv\ 


INIvl UUOUOD 




D-003434-01 


TCACGGAGCTCACGGAATT 


1505 


PPARA 


NM- 005036 


7549810 


D-003434-02 


GAACATGACATAGAAGATT 


1506 


PPARA 


NM 005036 


7549810 


D-003434-03 


GGATAGTTCTGGAAGCTTT 


1507 


PPARA 


NM 005036 


7549810 


D-003434-04 


GACTCAAGCTGGTGTATGA 


1508 














PPARD 


NM 006238 


5453939 


D-003435-01 


GAGCGCAGCTGCAAGATTC 


1509 
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DD A DH 

rrAKU 


ft. 1 ft ,1 AAOOOO 

NM 006238 


5453939 


D-003435-02 


GCATGAAG CTG GAGTACG A 


1510 


DD A DPI 
rrAKU 


NM 006238 


5453939 


D-003435-03 


GGAAGCAGTTGGTGAATGG 


1511 


DDADh 


ft 1 ft /I AAOOOO 

NM 006238 


5453939 


D-003435-04 


GCTGCAAGATTCAGAAGAA 


1512 














DD A do 


Nihil A O O ~7 <1 t~*i 

NM 138712 


20336234 


D-003436-01 


AGACTCAGCTCTACAATAA 


1513 


DD A DP 

rrAKb 


ft. 1 ft A 00*"7«'l* p '» 

NM 138712 


20336234 


D-003436-02 


GATTGAAGCTTATCTATGA 


1514 


DD A DO 


K 1 ft A A O O "7 A *~\ 

NM 138712 


20336234 


D-003436-03 


AAGTAACTCTCCTCAAATA 


1515 


DD A DO 

rrAKb 


ft, 1 ft 1 A O O ~7 A 

NM 138712 


20336234 


D-003436-04 


GCATTTCTACTCCACATTA 


1516 














D A D A 

KAKA 


NM 000964 


4506418 


D-003437-01 


GACAAGAACTGCATCATCA 


1517 


D A D A 

KAKA 


NM 000964 


4506418 


D-003437-02 


GCAAATACACTACGAACAA 


1518 


D A D A 

KAKA 


NM 000964 


450641 8 


D-003437-03 


G AACAACAG CTC AG AAC AA 


1519 


D A D A 

KAKA 


NM 000964 


450641 8 


D-003437-04 


GAGCAGCAGTTCTGAAGAG 


1520 














D A DD 

KAKd 


NM 000965 


1 491 6493 


D-003438-01 


GCACACTGCTCAATCAATT 


1521 


D A DD 

KAKd 


NM 000965 


14916493 


D-003438-02 


GCAGAAGTATTCAGAAGAA 


1522 


D A DD 

KAKd 


ft. 1 ft H AAArtrtl- 

NM 000965 


14916493 


D-003438-03 


GGAATGACAGGAACAAGAA 


1523 


D A DD 

KAKd 


ft. I ft n r\f\r\r\*~\t— 

NM 000965 


14916493 


D-003438-04 


GCACAGTCCTAGCATCTCA 


1524 














D A DO 

KAKb 


ft l ft n /"\ /"\ ^> /-» 

NM 000966 


21359851 


D-003439-01 


GAAATGACCGGAACAAGAA 


1525 


KARG 


NM 000966 


21359851 


D-003439-02 


TAGAAGAGCTCATCACCAA 


1526 


DADO 

KAKb 


K 1 ft A AAnn/%^ 

NM 000966 


21359851 


D-003439-03 


CAAGGAAGCTGTGCGAAAT 


1527 


KAKG 


NM 000966 


21359851 


D-003439-04 


TCAGTGAGCTGGCTACCAA 


1528 














r—\ t-\ n a 

kora 


NM 134261 


1 9743902 


D-003440-01 


GGAAAGAGTTTATGTTCTA 


1529 


rora 


NM 134261 


1 9743902 


D-003440-02 


CAAG ATCTGTG G AG ACAAA 


1530 


RORA 


NM 134261 


1 9743902 


D-003440-03 


G CACCTGACTG AAG ATGAA 


1531 


j—) /-^\ r~> a 

RORA 


NM 134261 


1 9743902 


D-003440-04 


CCGAGAAGATGGAATACTA 


1532 














DODD 

RORB 


NM 006914 


1 9743906 


D-003441-01 


GCACAGAACATCATTAAGT 


1533 


RORB 


NM 006914 


1 9743906 


D-003441-02 


CCACACCTATGAAGAAATT 


1534 


KORB 


NM 006914 


1 9743906 


D-003441-03' 


GATCAAATTCTACTTCTGA 


1535 


DODD 

KUKd 


K Ift M /^i t~\f\r\ Jt A 

NM 006914 


1 9743906 


D-003441-04 


TCAAAC AG ATAAAG CAAG A 


1536 














DODO 

RORC 


NM 005060 


1 9743908 


D-003442-01 


TAGAACAGCTGCAGTACAA 


1537 


DODO 

KUKU 


NM 005060 


1 9743908 


D-003442-02 


TCACCGAGGCCATTCAGTA 


1538 


RORC 


NM 005060 


1 9743908 


D-003442-03 


GAACAGCTGCAGTACAATC 


1539 


DODO 

RORC 


ft 1 ft A nArrtAn 

NM 005060 


1 9743908 


D-003442-04 


CCTCATGCCACCTTGAATA . 


.1540 












-* 


DVD A 

KAKA 


NM 002957 


21536318 


D-003443-01 


TGACGGAGCTTGTGTCCAA 


1541 


DVD A 

KAKA 


NM 002957 


21536318 


D-003443-02 


CAACAAGGACTGCCTGATT 


1542 


DVD A 

KAKA 


NM 002957 


21536318 


D-003443-03 


GCAAGGACCTGACCTACAC 


1543 


DVD A 

KAKA 


Kilt /I AOOOITT 

NM 002957 


21536318 


D-003443-04 


GCAAGGACCGGAACGAGAA 


1544 














DVDD 

KAKd 


NM 021976 


21687229 


D-003444-01 


G CAAAG ACCTTACATACTC 


1545 


DVDD 

KAKd 


NM 021976 


21687229 


D-003444-02 


GCAATCATTCTGTTTAATC 


1546 


DVDD 

KAKd 


K I ft ao«4 r\~7 

NM 021976 


21687229 


D-003444-03 


TCACACCGATCCATTGATG 


1547 ! 


DVDD 

KAKB 


ft 1 ft X AO*4/"V^/"* 

NM 021976 


21687229 


D-003444-04 


G C AAACG G CTATGTG C AAT 


1548 














DVDO 

KAKCd 


ft 1 ft 4 A AO<~\ H ~7 

NM 006917 


21361386 


D-003445-01 


GGAAGGACCTCATCTACAC 


1549 


RXRG 


NM 006917 




U"UUO'r fc TvJ UZ, 




1550 


RXRG 


NM 006917 


21361386 


D-003445-03 


GCGAGCCATTGTACTCTTT 


1551 


RXRG 


NM 006917 


21361386 


D-003445-04 


GAG CCATTGTACTCTTTAA 


1552 














THRA 


NM 003250 


20127451 


D-003446-01 


GGACAAAGACGAGCAGTGT 


1553 


THRA 


NM 003250 


20127451 


D-003446-02 


GGAAACAGAGGCGGAAATT 


1554 
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THRA 


NM 003250 


20127451 


D-003446-03 


GTAAGCTGATTGAGCAGAA 


1555 


THRA 


NM 003250 


20127451 


D-003446-04 


GAACCTCCATCCCACCTAT 


1556 














THRB 


NM 000461 


10835122 


D-003447-01 


G AATGTCG CTTTAAG AAAT 


1557 


THRB 


NM 000461 


10835122 


D-003447-02 


GAACAGTCGTCGCCACATC 


1558 


THRB 


NM 000461 


10835122 


D-003447-03 


GGACAAGCACCAATAGTCA 


1559 


TUDD 


mi\/i r\r\r\ a a a 


1 0835122 


D-003447-04 


GTGGAAAGGTTGACTTGGA 


1560 














VDR 


NM 000376 


4507882 


D-003448-01 


TGAAGAAGCTGAACTTGCA 


1561 


VDR 


NM 000376 


4507882 


D-003448-02 


G CAACCAAG ACTACAAGTA 


1562 


VDR 


NM 000376 


4507882 


D-003448-03 


TCAATGCTATGACCTGTGA 


1563 


VDR 


NM 000376 


4507882 


D-003448-04 


CCATTGAGGTCATCATGTT 


1564 



Table X 



Gene 


Sense 


SEQ ID NO. 


Symbol 




u: 1 , '^M" .; Ik 


ABCB1 


GACCAUAAAUGUAAGGUUU 


1565 




UAGAAGAUCUGAUGUCAAA 


1566 




GAAAUGUUCACUUCAGUUA 


1567 




GAAGAUCGCUACUGAAGCA 


1568 



ABCC1 



ABCG2 



KCNH2 



KCNH1 



CLCA1 



' - . ■ Sense: - T>J ■ - - 




G G AAG C A AC U G C AG AG AC A 


1569 


GAUGACACCUCUCAACAAA 


1570 


UAAAGUUGCUCAUCAAGUU 


1571 


CAACG AG U CU GCCG AAG G A 


1572 




■ ~ Sensb 




GCAGAUGCCUUCUUCGUUA -a" 


1573 


lAGGCAAAUCUUCGUUAUUA 


1574 


GGGAAGAAAUCUGGUCUAA 


1575 


UGACUCAUCCCAACAUUUA 


1576 




Sense • 




CCGACGUGCUGCCUGAGUA 


1577 


GAGAAGAGCAGCGACACUU 


1578 


GAUCAUAGCACCUAAGAUA 


1579 


GCUAUUUACUGCUCUUAUU 


1580 


UCACUGGGCUCCUUUAAUU 


1581 


GUGCGAGCCUUCUGAAUAU 


1582 


GCUAAGCUAUACUACUGUA 


1583 


UGACGGCGCUCUACUUCAC 


1584 




Sense • r : ' 




GAGAUGAAUUCCUUUGAAA 


1585 


G AAG A ACG C A U G AAACG AA 


1586 


iG AU AAAG ACACG AU UG AAA 


1587 


GCUGAGAGGUCUAUUUAAA 


1588 : 




Sense 




GAACAACAAUGGCUAUGAA 


1589 


guacauaccuggcuggauu 


1590 
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SLC6A1 



SLC6A2 



SLC21A2 



SLC21A3 



GAACAGCUCACAAGUAUAU 


1591 


GGAAACGUGUGUCUAUAUU 


1592 




Sense 




! GG AGGUGGG AGGACAG U U A 


1593 


I UCACAGCCCUGGUGGAUGA 


1594 


GAAGCUGGCUCCUAUGUUC 1595 


GGUCAACACUACCAACAUG 1596 




' Sense 




G AACACAAGG UC AACAU UG 


1597 


AGAAGGAGCUGGCCUAGUG 


1598 


CGGAAACUCUUCACAUUUG 


1599 


CAAC AAAU U UG ACAACAAC 


1600 




'. Sense 




GUACAUCUCCAUCUUAUUU 


1601 


GGAAGUGGCUGAGUUAUUA 


1602 


| GAAGGGAGGCUCAAUGUAA 


1603 


| GAAGGAAGUGGCUGAGUUA 


1604 




Sense . ; "% 




GUAGAAACAGGAGCUAUUA 


1605 


CAAGAUUACUGUCAAACAA 


1606 


GCACAAGAGUAUUUGGUAA 


1607 


GCAAAUGUCCCUUCUGUAU 


1608 


GCAUGACUCCUAUAUAAUA 


1609 


AAACAGCAAUUUCCCUUAA 


1610 


GAAAAUGCCUCUUCAGGAA 


1611 



SLC28A1 



SLC29A1 



SLC26A1 



Sense 


. ' ■ - : r 


GUUCAUCGCUCUCCUCUUU 


*■ 1612 


GGAUCAAGCUGUUUCUGAA . ~r 


1613 


GGACUGCAGUUUGUACUUG 


1614 


GAGUGAAACUGACCUAUGG 


1615 




, y ' v Sense - <~ 4;: 




GAACGCUGCUCCCGUGGAA 


1616 


GAAAGCCACUCUAUCAAAG 


1617 


GAAACCAGGUGCCUUCAGA 


1618 


CCUCACAGCUGUAUUCAUG 


1619 




: ?h Sense . ■ 


CCACGGAGCUGCUGGUCAU 


1620 


GGGUUGACAUCUUAUUUGA 


1621 


GCACGAGGGUCUCUGUGUU 


1622 


GGCCAUCGCCUACUCAUUG 


1623 


CAACACCCAUGGCAAUUAA 


1624 


GAGGAAAGAUCUUGCUGAU 


1625 


GAGCAAGCGUCCUCCAAAU 


1626 


GCAACACCCAUGGCAAU U A 


1627 



SLC26A2 


Sense o^i 






CCAAAGAACUCAAUGAACA 


1628 




ACAAGAACCUUCAGACUAA 


1629 




GAAGGUAGAUAGAAGAAUG 


1630 
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| GUAUUGAACUGUACUGUAA 



1631 



SLC4A4 



GLRA1 



KLK1 



ADAM2 



XPNPE 
PI 



GZMA 



CMKLR1 



CLN3 



CALCR 



Sense 




GCAAUUCUCUUCAUUUAUC 


1632 


GGAAAGAUGUCCACUGAAA 


1633 


GGACAAAGCCUUCUUCAAU 


1634 


GGAAUGGGAUCCAGCAAUU 


1635 




Sense f'" 




UGAAAG CCAU UG ACAU U UG 


1636 


CAGACACGCUGGAGUUUAA 


1637 


CAAUAGCGCUUUCUGGUUU 


1638 


GCAGG UAGCAGAUGGACU A 


1639 




Sense 


J% ■ , - ■ - • • 


UCAGAGUGCUGUCUUAUGU 


1640 


CAACUUGUUUGACGACGAA 


1641 


UGACAGAGCCUGCUGAUAC 


1642 


AGGCGGCUCUG UACCAU U U 


1643 




■ ''- Sense .' ris- • 




GAAACAUGCUGUGAUAUUG 


1644 


GCAGAUGUUUCCUUAUAUA 


1645 


CAACAGAGAUGCCAUGAUA 


1646 


GAAAGGCGCUACAUUGAGA 


1647 




Sense g -- t 


, ' .|i,4i, . ' ..{' 


GACCUGAGCUUCCCAACAA 


1648 


GCGACUGGCUCAACAAUUA 


1649 


GAGAUUGCGUGGCUAUUUA 


1650 


GACAGCAACUGGACACUUA 


1651 




t , : Sense > . ; ; 




GGAAGAGACUCGUGCAAUG 


1652 


GGAACCAUGUGCCAAGUUG 


1653 


GAAGUAACUCCUCAUUCAA 


1654 


GAACUCCUAUAGAUUUCUG ~~ 


1655 




Sense 


• ■ "■- :\;[ ■ 


CAUAGAAGCUUUACCAAGA 


1656 


GAAUGGAGGAUGAAGAUUA 


1657 


GGUCAAUGCUCUAAGUGAA 


1658 


GAGAGGACUUCUAUGAAUG 


1659 




Sense " 




CAUCAUGCCUUCUGAAUAA 


1660 


CAACAGCUCAUCACGAUUU 


1661 


gcaacaacuucucuOaugu 


1662 


GGUCUUCGCUAGCAUCUCA 


1663 




Sense 




GGACCUAGCUGUUGUAAAG 


1664 


GAAAGACCAUGCAU U U AAA 


1665 


GCAGGAAGAUGUAUGCUUU 


1666 


G AAU AAACC AG U AU CG U U A 


1667 
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OXTR 



EDG4 



EDG5 



EDG7 



PTCH 



SMO 



CASP3 



CASP6 



CASP7 



Sense 




GGACCCAGAUAUCCAAAUA 


1668 


GCAAUACUAUCCUAACUGA 


1669 


G AAU AU AG AU UAGCG U U U G 


1670 


GAUGAGGCAUGACUACUAA 


1671 




Sense 




GCGAGUCUGUCCACUAUAC 


1672 


G AG AACG G CCACCCACU G A 


1673 


GAACGGCCACCCACUGAUG 


1674 


I GGUCAAUGCUGCUGUGUAC 


1675 




% ' Sense 




UCCAGGAACACUAUAAUUA 


1676 


GUGACCAUCUUCUCCAUCA 


1677 


CAUCCUCUGUUGCGCCAUU 


1678 


CCAACAAGGUCCAGGAACA 


1679 




* Si ' Sense f&. I 




ACACUGAUACUGUCGAUGA 


1680 


AAUAGGAGCAACACUGAUA 


1681 


CAGCAGGAGUUACCUUGUU 


1682 


GGACACCCAUGAAGCUAAU H 


1683 i 




Sense 


. - \ - 


G C AC AG AAC U CC AC U C A AA 


1684 


GGACAGCAGUUCAUUGUUA 


1685 


GAGAAGAGGCUAUGUUUAA 


1686 


GGACAAACUUCGACCCUUU 


1687 | 




\ : . i ; Sense 


? - . 


UCGCUACCCUGCUGUUAUU 


1688 


G C U AC AAG AAC U ACCG AU A 


1689 


CAAGAAAGCUUCCUUCAAC 


1690 


G AG AAG AA AU AC AG U C AAU 


1691 




:KK- l : / : -|: Sense : " "-; 




CAAUAUAUCUGAAGAGCUA 


1692 


GAACUGGACUC3UGGCAUUG 


1693 


G UG AG AAG AUGG U AU AU U U 


1694 


G AGG G U AC U U U AAG AC AU A 


1695 




. i Sense - /'g 




CAUGAGGUGUCAACUGUUA 


1696 


GAAGUGAAAUGCUUUAAUG 


1697 


AAAUAUGGCUCCUCCUUAG 


1698 


| GC AAUCACAU U U AUGC AU A 


1699 


CAACAUAACUGAGGUGGAU 


1700 


1 CAUGGUACAUUCAAGAUUU 


1701 




i Sense 




GAACUCUACUUCAGUCAAU 


1704 


GGGCAAAUGCAUCAUAAUA 


1703 


CAAC AG AG GG AG U U U AAU A 


1704 


GAACAAAGCCACUGACUGA 


1705 
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CASP9 
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DVL2 



PTEN 



PDK1 



PDK2 



PPP2CA 



CTNNA1 



133 



GAAGUGAACUAUGAAGUAA 



CAACAAG G AUG AC AAG AAA 



GGACAAAGUUUACCAAAUG 



gaauauagagggcu uaug'a 
caacgacu augaag aau uc~ 



gaagu gagcagaucagaau 

GAGGAAAUCUCCAAAUGCA 



Sense 



CCAGGCAGCUGAUCAUAGA 



UCUCAGGUGUUGCCAAAUA 



GAACAGCUGUAAUCUAUGA 



CCACUGG UCUG UAGGGAUU 



1706 



1707 
1708 



1710 



1711 



1712 



1713 



1714 



1715 



171 6 
1717 



& -T t Sense ■ % ' I 


; ..... 


UCG U AAAGCUG U UGAU AUC 


1718 


GAG G AG AU C U U U G A U G AC A 


1719 


GUAAAGCUGUUGAUAUCGA 


1720 


GAUCGUAAAGCUGUUGAUA 


1721 




v . Sense ; 0 t> 




AGACGAAGGUGAUUUACCA 


1722 


UGUGAGAGCUACCUAGUCA 


1723 


G AAG AAAU U U C AG AUG ACA 


1724 


UAAUAGGCAUUUCCUCUUU 


1725 




... Jv, % - :T*i. % Sense ; . >■ , 




GUGAAGAUCUUGACCAAUG 


1726 


GAUCAGCAUACACAAAUUA 


1727 


GAAUGAACCUUCUGCAACA 


1728 


GGCGCUAUGUGUAUUAUUA 


1729 




% ; - . : ■ • Sense . | $ 




GUACAAAGCUGGUAUAUCC 


1730 


GAAAG ACUCCCAG UG U AUA 


1731 


GGAAGUCCAUCUCAUCGAA 


1732 


CCAAAG ACAUG ACG ACG U U 


1733 




- V ; f Sense 


•% ■ 'fc- . -:. ;■• 


GUAAAGAGGAGACUGAAUG 


1734 


, GGUCUGUGAUGGUCCCUAA 


1735 


CAAAGAUGCCUACGACAUG 


1736 


GGGCGAUGCCUGAGGGUUA 


1737 




Sense 




j UCACACAAG U U U AUGG U U U 


1738 


CAACAG CCG UG ACC ACU U 0 


1739 


UAACCAAGCUGCAAUCAUG 


1740 


GAACU UGACG AUACUCU AA 


1741 




Sense 




iGAAG AG AGG UCG U UCU AAG 


1742 


AAGCAGAUGUGCAUGAUUA 


1743 


UCUAAUAACUGCAGUGUUU 


1744 
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HSPCA 



DCTN2 



CTNNA2 
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GUAAAGGGCCCUCUAAUAA 1745 | 


• Sense 




GAAAGAAUAUGCCCAAGUU 


1746 


GAAGAAGAAUGCCACAAUG 


1747 


GCAGGAAGAUUAUGAUGUG 


1748 


AAAGAAAGCCCAUG U ACU A 


1749 




Sense 




GGGAAAGAGCUGCAUAUUA 


1750 


GCUUAGAACUCUUUACUGA 


1751 


UAUAAGAGCUUGACCAAUG 


1752 


GCAGAUAUCUCUAUGAUUG 


1753 




: - Sense 




CAACUCAUGUCCAAUACUG 


1754 


GGAAUGAGCCAGAUGUUUA 


1755 


GGAGACAGCUGUACGUUGU 


1756 ' 


IjCCAAGAGCUGACAACUGA 


1757 



CD2 



Sense 




GUAAGGAGAAGCAAUAUAA 


1758 


AAGAUGAGCUUUCCAUGUA 


1759 


GGACAUCUAUCUCAUCAUU 


1760 


GACAAGAGCCCACAGAGUA 


1761 



BAD 



^liu- Sense 




GUACUUCCCUCAGGCCUAU 


1762 


GCUGUGCCUUGACUACGUA 


1763 


GUACUUCCCUCAGGCCUAU 


1764 


GGUCAGGUGCCUCGAGAUC 


1765 



SMAC 



MAP3K5 



PVR 



ERBB2 



Sense 




CAGCG UAACU UCAU UCUUC 


1766 


UAACUUCAUUCUUCAGGUA 


1767 


CAGCUGCUCUUACCCAUUU 


1768 


GAUUGAAGCUAUUACUGAA 


1769 


UAGAAGAGCUCCGUCAGAA 


* 1770 


CCACAUAUGCG U UG AU U G A 


1771 


GCGCAGGGCUCUCUACCUA 


1772 




Sense r 




G AACAGCCU U CAAAU CAAA 


1773 


GAUGUUCUCUACUAUGUUA 


1774 


GCAAAUACUGGAAGGAUUA 


1775 


CAGGAAAGCUCGUAAUUUA 


1776 




Sense 




CCACACGGCUGACCUCAUA 


1777 


CAGCAG AAU UCCUCU U AU A 


1778 


GCAGAAUUCCUCUUAUAAA 


1779 


GAUCGGGAUUUAUUUCUAU 


1780 




Sense 




UGUGGGAGCUGAUGACUUU 


1781 


UCACAGAGAUCUUGAAAGG 


1782 
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SOS1 



BRCA1 



CDKN1A 



CDKN1B 



SLC2A4 



NOS2A 



FRAP1 



FKBP1A 



UGGAAGAGAUCACAGGUUA 


1783 


GCUCAUCGCUCACAACCAA 


1784 




. . Sense " 




GAGCACCACUUCUAUGAUU 


1785 


CAAAGAAGCUGUUCAAUAU 


1786 


UGAAAGCCCUCCCUUAUUA 


1787 


IgAAAUAGCAUGGAGAAGGA 


1788 




. ,-i-T;.:$-' ' Sense ■■ Ifc. • 




CCAUACAGCUUCAUAAAUA 


1789 


GAAGAGAACUUAUCUAGUG 


1790 


GAAGUGGGCUCCAGUAUUA 


1791 


GCAAGAUGCUGAUUCAUUA 


1792 


GAAGUGGGCUCCAGUAUUA 


1793 


GAACGGACACUGAAAUAUU 


1794 


GCAGAUAGUUCUACCAGUA 


1795 




: -rpv'-^ Sense M 




GAACAAGGAGUCAGACAUU 


1796 


AAACUAGGCGGUUGAAUGA 


"1797 


GAUGGAACUUCGACUUUGU 


1798 


GUAAACAGAUGGCACUUUG 


1799 




, - : ^. 'Sense >. 


It--' ; : ' ■ ' ;: -S : 


GGAAUGGACAUCCUGUAUA 


1800 


GGAG AAAGAUG UCAAACG U 


1801 


•"GAAUGGACAUCCUGUAUAA 


1802 


I GUAAACAGCUCGAAUUAAG 


1803 




I Sense 




I CAGAUAGGCUCCGAAGAUG 


1804 


AGACUCAGCUCCAGAAUAC 


1805 " 


GAUCGGUUCUUUCAUCUUC 


1806 


I CAGGAUCGGUUCUUUCAUC 


*~ 1807 




Sense 




CCAGAUAAG UG ACAUAAG U 


1808 


UAAGUGACCUGCUUUGUAA 


1809 


G AAGAG AG AU UCCAU UG AA 


1810 


UGAAAGAGCUCAACAACAA 


1811 




/Sift: Sense : ■ i;. 




G AGCAUG CCG U CAAU AAU A 


1812 


CAAGAGAACUCAUCAUAAG 


1813 


CCAAAGUGCUGCAGUACUA 


1814 


UAAG AAAGCU AUCCAGAU U 


1815 




Sense 


- . ' ' "* "..h ... 


G AAACAAGCCC U U UAAG U U 


1816 


GAAUUACUCUCCAAGUUGA 


1817 


CAGCACAAGUGGUAGGUUA 


1818 


GUUGAGGACUGAAUUACUC 


1819 


GAUGGCAGCUGUUUAAAUG 


1820 


GAGUAUCCUUUCAGUGUUA 


1821 
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TNFRSF 
1A 



IL1R1 



IRAKI 



TRAF2 



TRAF6 



■■ ; v : * i Sense - . , 


-.; . ri" ". 


CAAAGGAACCUACUUGUAC 


1822 


GGAACCUACUUGUACAAUG 


1823 


GAACCUACUUGUACAAUGA 


1824 


GAGUGUGUCUCCUGUAGUA 


1825 




• Sense 




GGACAAGAAUCAAUGGAUA 


1826 


GAACAAGCCUCCAGGAUUC 


1827 


GGACUUGUGUGCCCUUAUA 


1828 


GAACACAAAGGCACUAUAA 


1829 




" / H . Sense i 




CGAAGAAAGUGAUGAAUUU 


1830 


GCUCUUUGCCCAUCUCUUU 


1831 


UGAAAGACCUGGUGGAAGA 


1832 


GCAAUUCAGUUUCUACAUC 


1833 




( -IS, z S$me : ~X ' 'Si 




G AAG ACAG AG U U AU U AAAC 


1834 


U C ACG AAG AC AG AG U U AU U 


1835 


AG AC AG AG U U AU UAAACCA 


1836 


C ACG AAG AC AG AG U U AU U A 


1837 


GCUGAAGCCUGUCUGAUGU 


1838 




Sense 


. .... ^ 


CAAAUGAUCUGAGGCAGUU 


1839 


GUUCAUAGUUUGAGCGUUA 


1840 


GGAGAAACCUGUUGUGAUU 


1841 


GGACAAAGUUGCUGAAAUC 


1842 


CAAAUGAUCUGAGGCAGUU 


1843 


GGAGAAACCUGUUGUGAUU 


1844 


GGACAAAGUUGCUGAAAUC 


1845 


GUUCAUAGUUUGAGCGUUA 


1846 



TRADD 



;f .. Sense i : 




UGAAGCACCUUGAUCUUUG 


1847 - 


GGGCAGCGCAUACCUGUUU 


1848 


GAGGAGCGCUGUUUGAGUU 


1849 


GGACGAGGAGCGCUGUUUG 


1850 


GAGGAGCGCUGUUUGAGUU 


1851 


\ GGAUGUCUCUCUCCUCUUU 


1852 


GCUCACUCCUUUCUACUAA 


1853 


UGAAGCACCUUGAUCUUUG 


1854 



FADD 



Sense!; 




GCACAGAUAUUUCCAUUUC 


1855 


GCAGUCCUCUUAUUCCUAA 


1856 


GAACUCAAGCUGCGUUUAU 


1857 


GGACGAAUUGAGAUAAUAU 


1858 



IKBKE 


Sense 






UAAGAACACUGCUCAUGAA 


1859 




GAGGCAUCCUGAAGCAUUA 


1860 




GAAGGCGGCUGCAGAACUG 


1861 
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I KB KG 



TNFRSF 
5 



RELA 



ARHA 



CDC42 



R0CK1 



PAK1 



PAK2 



GGAACAAGGAGAUCAUGUA 


1862 1 




Sense 


CUAUCGAGGUCGUUAAAUU 1863 


GAAUGCAGCUGGAAGAUCU 1864 


GCGGCGAGCUGGACUGUUU 1865 


CCAGACCGAUGUGUAUUUA 1866 1 




: Sense 




GGUCUCACCUCGCUAUGGU 


1867 


G/^AGCGAAUUCCUAGACA 


1868 


G C AC AA AC A AG AC U G A U G U 


1869 


GAAGGGCACCUCAGAAACA 


1870 


UCUCCCAACUUGUAUUAAA 


1871 




t * If '■ Sense 


p- ' - & .' , • \ 


UCAAGUGUCUUCCAUCAUG 


1872 


U C AAG UGCCU U AAU AG U AG 


1873 


GG AG UACCCUGAGGCUAUA 


1874 


i GAUGAGAUCUUCCUACUGU 


1875 




... %. . , ^ > Sense ■ 5 \ % - 


■ 


GAGCUGGGCUAAGUAAAUA 


1876 


G ACCAAAG AU G GAG U GAGA 


1877 


GGAAGAAACUGGUGAUUGU 


1878 


GGCUGUAACUACUUUAUAA 


1879 




Sense ■ - iu 


GGACAUUUGUUUGCCAUUU 


1880 


GGAGAACCAUAUACUCUUG 


1881 


GAACCAAUGCUUUCUCAUG 


1882 


GAAGACCUGUUAUGUAGAG 


1883 


GAUCAAGAAUUGCAAUAUC 


1884 


GAAAAGGGGUGACCUAGUA 


1885 


UGACAAACCUUAUGGAAAA 


1886 




. Sense %. § 




GGAAUGAGCUUCAGAUGCA 


1887 


GG ACACAGC UG U AAG AU U G 


1888 


G ACA AG AG AU U AC AG AU AA 


1889 


GAAGAAACAUUCCCUAUUC 


1890 




< ;H > \ . Sense 




G AGGG U G G U U U AUG AU U AA 


1891 


CAACAAAGAACAAUCACUA 


1892 


G AAG A A AU AU AC ACG G U U U 


1893 


U ACAUG AGCU U UAC AGAUA 


1894 




Sense 




GGUAGGAGAUGAAUUGUUU 


1895 


AGAAGGAACUGAUCAUUAA 


1896 


CUACAGACCUCCAAUAUCA 


1897 


GAAACUGGCCAAACCGUUA 


1898 I 



PAK3 



Sense 
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PAK4 



1 GAUUAUCGCUGCAAAGGAA 



GAGAGUGCCUGCAAGCUUU 



1899 



1900 



GACAAGAGGUGGCCAUAAA 


1901 j 


UUAAAUCGCUGUCUUGAGA 


1902 




i '- ; ; . Sense 


' - . ... 3/ ■ 


ACUAAGAGGUGAACAUGUA 


1903 


GAUCAUGAAUGUCCGAAGA 


1904 


GAUGAGACCCUACUACUGA 


1905 


CAGCAAAGGUGCCAAAGAU 


1906 



PAK6 



Sense 




UAAAGGCAGUUGUCCACUA 


1907 


GAAGGGACCUGCUUUCUUG 


1908 


GCAAAGACGUCCCUAAGAG 


1909 


CCAAUGGGCUGGCUGCAAA 


1910 



PAK7 







GAGCACGGCUUUAAUAAGU 


1911 


CAAACUCCGUUAUGAUAUA 


1912 


GGAUAAAGUUGUCUGAUUU 


1913 


GGAAAUGCCUCCAUAAAUA 


1914 



HDAC1 



Sense 




GGACAUCGCUGUGAAUUGG 


1915 


AGAAAGAAGUCACCGAAGA 


1916 


GGACAAGGCCACCCAAUGA 


1917 


CCACAGCGAUGACUACAUU 


1918 



HDAC2 



Sense 




GCUGUUAAAUUAUGGCUUA 


1919 


G C AA AG A AAG C U AG AAU U G 


1920 


CAUCAGAGAGUCUUAUAUA 


1921 


CCAAUGAGUUGCCAUAUAA 


1922 



CREBBP 



Senise - : 




GGCCAUAGCUUAAUUAAUC 


1923 


GCACAGCCGUUUACCAUGA 


1924 


GG ACAG CCC UUU AG U C AAG 


1925 


GAACUGAUUCCUGAAAUAA 


1926 



BTRC 



Sense 




CAC AU AAACU CGUAUCUUA 


1927 


G AG AAGG CAC UC AAG U U U A 


1928 


AG ACAU AG U U U ACAG AG AA 


1929 


GCAGAGAGAUUUCAUAACU 


1930 



RIPK2 



Sense 




GAACAUACCUGUAAAUCAU 


1931 


"G G ACAUCG ACCUG UUAUUA 


1932 


UAAAUGAACUCCUACAUAG 


1933 


GGAAUUAUCUCUGAACAUA 


1934 



VAV1 


Sense 






GCAGAAAUACAUCUACUAA 


1935 


r 


GCUAUGAGCUGUUCUUCAA 


1936 



CGACAAAGCUCUACUCAUC 


1937 ! 


GCUCAACCCUGGAGACAUU 


1938 i 




Sense 




GGACAAGACUCGCAGAUUU 


1939 


GCUGAGCGCUUUGCAAUAA 


1940 


CAAGAAGUCUCACGGGAAA 


1941 


UCACAGAGGCCAAGAAAUU 


1942 




if"' <if Sense ZPy' . '% 




UGGAAGCCAUCGCCAAAUA 


1942 


CAUCAGUGCAUGACGUUUA 


1943 


UGAAUGAGCUGGUGGAUUA 


1944 


UGCCAAAACUUACCUAUAA 


1945 




Sense 




GAGCUGCACUCCAAUGAGA 


1946 


G AAACC AAG CC AU U AAU G A 


1947 


CCAAGGAGCUACUGACAUU 


1948 


AG AG AA AC AU G G CCC A A U A * 


1949 




• Sense ■ ' f 4 - 




CCACAGACAUUUACAUUAA 


1950 


GAAGGGAGUUUGCUAAAUU 


1951 


GAACAGAUCUGAUGAAUGA 


1952 


CAAGAGAGCUGAAGACUAU 


1953 




fe' a:SensU 




GCAUAUAUAUUCAGCAUUG 


1954 


CAACUUGACUGCAGUAUUG 


1955 


GAACUUAACUUUCCAUGUU 


1956 


GACAAGACCUGUAGUAAUU 


1957 




Sense 




AGAAAGAGCUUGACAGUAA 


1958 


GG AAG U AG U UC ACAAAAU A 


1959 


UGAAGUAUCUGUAUCCAAA 


1960 


G AGCU U C ACU CCC UU AG U U 


1961 | 




Sense [:K # 


UAAGGACUCUGAAGAUGUA 


1962 


GACAAAGUGUGUAAUUAUG 


1963 


G C U C AG G AC U U AG C AAG AA 


1964 


GAAACUGAAUACCUAAGAU 


1965 


GAAACUGAAUACCUAAGAU 


1966 


UAAGGACUCUGAAGAUGUA 


1967 


GACAAAGUGUGUAAUUAUG 


1968 


GCUCAGGACUUAGCAAGAA 


1969 




Sense 




CCAUCCAGCUGAUCCAGAA 


1970 


GAACCCUCCUGAUGAGAGU 


1971 


GAGGACAUCCACCAGUACA 


1972 




Sense 




GAUUAGAGACCAAGGAUUU 
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ELK1 



RALGDS 



PRKCA 



MAPK8 



MAPK9 



AIF1 



MAP2K4 



MAP2K7 



CCACUGAUGUGUGUUAAUU 


1973 


CAAUAG^CCUGUCAAUAU 


1974 


GAAGACAGGAAUCGAAUGA 


1975 




S#nse ' -fr* 




G AUG UG AG U AG AAG AG U U A 


1975 


GGAAGAAU U UG U ACC AU U U 


1976 


GAACGACCUUUCUUUCUUU 


1977 


GGAGUCAUCUCUUCCUAUA 


1978 




#3 .Sense -M - 




GGAGAAGCCUCACCUCUUG 


1979 


GCAGAAAGGACUCAAGAUU 


1980 


GAGAACAACUACUCAUUGA 


1981 


GAACUUCUCGUCACUGUAU 


1982 




: '" . Sense 




GGAUUGUUCUUUCUUCAUA 


1983 


GAAGGGUUCUCGUAUGUCA 


1984 


GAAGAAGGAUGUGGUGAUU 


1985 


GGACUGGGAUCGAACAACA 


1986 






Sense 


- ■ .;" . ,:H . 


GGACAGAAGUGGAAAUAUU 


1987 


UCAAAG AGG UG AACAU U AA 


1988 


GACCAAAUCUCAGUUGUUU 


1989 


GGAGAAUGGUGCUGUUUAA 


1990 




Sense ■ 7 


,: v.'^i : '§; 


G AAG AG ACC AA AG U AU A AU 


1991 


GAAGACCGGCCACGUCAUU 


1992 


GGAAGAGACCAAAGUAUAA 


1993 


GCAUUGAGAUUGACCAGAA 


1994 | 


U G AG AG AACG AG AAAG U UG 


1995 


GUGAAACCCUGUCUGCAUU 


1996 


GGAUCUCUCUCAACAACUA 


1997 


ACAACUAGGUGAACACAUA 


1998 




-#|, SSise"*' > i < 




UCACAGUCCUGAAACGAUA 


1999 


G AU UG G AG AU UCU AC AU UC 


2000 


GCUCAUGGAUGCAAAUCUU 


2001 


G AAG C U AAG CCG ACC AU U U 


2002 




■ ■'/ ■ :■%, Sense 




AAAG AG AGCU U AUCG UG AA 


2003 


G AUG AU AGG U U AG AAAU AG 


2004 


ACAAAGAAGUCAUGGAUUG 


2005 


GGAGCUGGAUCAUGAAAGA 


2006 




Sense 




GAAAAGGGAUGAUGGGAUU 


2007 


CCUAGACGAUCCCAAAUAU 


2008 


G AGCCAAACCAGGGAU U U A 


2009 


UGAAACGAAUGCUGGAGAA 


2010 


U C AC U C ACC C AG AG AAA U A 


2011 
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BBC3 



BCL2L1 



BCL2L1 1 



BID 



BIRC2 



BIRC3 



BIRC4 



BIRC5 
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CCAAGAAAGCUAUCUCUGA 2012 


AGACUCACCUAGAGCUAAA 


2013 




i Sense 




CCUGGAGGGUCCUGUACAA 


2014 


GAG C AAAU GAG CC AA ACG U 


2015 


GGAGGGUCCUGUACAAUCU 


2016 


GACUUUCUCUGCACCAUGU 


2017 




i '•• . Sense ■ • ' 




CCAGGGAGCUUGAAAGUUU 


2018 


aaagDgcaguucaguaaua 


2019 


GAGAAUCACUAACCAGAGA 


2020 


GAGCCCAUCCCUAUUAUAA 


2021 




i i : Sense • ..• \c < 


-f " ' k 


G AG ACGAGU U U AACGCU U A 


2022 


AAAGCAACCUUCUGAUGUA 


2023 


CCG AG AAGG U AG ACAAU UG 


2024 


GCAAAGCAACCUUCUGAUG 


2025 


AGACAGAGCCACAAGGUAA 


2026 


GCAAGG AGG U U AG AG AAAU 


2027 


C AAGG AG G U U AG AG AAAU A 


2028 


UC U U ACG ACU G U U ACG U U A 


2029 ! 




•• •> . Sense . • 4 C 




GAAGACAUCAUCCGGAAUA 


2030 


CAACAGCG U U CCU AG AG AA 


2031 


GAAAUGGGAUGGACUGAAC 


2032 


ACGAUGAGCUGCAGACUGA 


2033 




Sense 




GAAAGAAGCCUGCAUAUAA 


2034 


GAAAUUGACUCUACAUUGU 


2035 


ACAAAU AG CACU U AG G U U A 


2036 


1 GAAUACACCUGUGGUUAAA 


2037 




Sense 




GGAGAUGCCUGCCAUUAAA 


2038 


UCAAUGAUCUUGUGUUAGA 


2039 


G AAAG A AC AU G U A AAG U G U 


2040 


GAAGAAAGAACAUGUAAAG 


2041 




- . Sense ' U M . 




GUAGAUAGAUGGCAAUAUG 


2042 


GAGGAGGGCUAACUGAUUG 


2043 


GAGGAACCCUGCCAUGUAU 


2044 


GCACGGAUCUUUACUUUUG 


2045 




Sense 




GGCGUAAGAUGAUGGAUUU 


2046 


GCAAAGGAAACCAACAAUA 


2047 


GCACAAAGCCAUUCUAAGU 


2048 


C AA AG G A AACC A AC AAU AA 


2049 



BRCA1 



Sense 
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CARD4 



CASP10 



CASP2 



CDKN1A 



CFLAR 



CLK2 



CLSPN 



CSNK2A 
1 



CCAUACAGCUUCAUAAAUA 


2050 


GAAGAGAACUUAUCUAGUG 


2051 


GAAGUGGGCUCCAGUAUUA 


2052 I 


GCAAGAUGCUG AU UCAU U A 


2053 I 


CCAUACAGCUUCAUAAAUA 2054 | 




.;. : - Sense i , r : \ 




GAAAGUUAAUGUCAAGGAA 


2055 


GAGCAACACUGGCAUAACA 


2056 


UAACAGAGAUUUGCCUAAA 


2057 


GCGAAGAGCUGACCAAAUA 


2058 




- ' : Sense U 




CAAAGGGUUUCUCUGUUUA 


2059 


GAAAUGACCUCCCUAAGUU 


2060 


GAAGGCAGCUGGUAUAUUC 


2061 


GACAUGAUCUUCCUUCUGA 


2062 


GCACUCUUCUGUUCCCUUA 


2063 ~ I 




1: ' Sense ; .: ^ W: 




GUAUUAAACUCUCCUUUGA 


2064 


GCAAGGAGAUGUCUGAAUA 


2065 


CAACUUCCCUGAUCUUUAA 


2066 


GCUCAAAGAUGUAAUGUAG 


2067 




*n ' Sense 


* 


G AAC AAG G AG UCAG AC A U U 


2068 


AAACUAGGCGGUUGAAUGA 


2069 


GAUGGAACUUCGACUUUGU 


2070 


GUAAACAGAUGGCACUUUG 


2071 




- U: 'Sense ' : . 




GAUGUGUCCUCAUUAAUUU 


2072 


G AAG AG AG AU AC AAG AU G A 


2073 


GAGCAUACCUGAAGAGAGA 


2074 


GCUAUGAAGUCCAGAAAUU 


2075 




Sense 


> ' ft - 


GUGAAUAUGUGAAAUAGUG 


2076 ! 


AAAGCAUGCUAGAGUAUGA 


2077 


U U AAG AAUG U GG AG AAG U A 


2078 


GAUAACAAGCUGACACAUA 


2079 




1 Sense 




1 GGACGUAAUUGAUGAAGUA 


2080 


1 GCAGAUGGGUUCUUAAAUG 


2081 


CAAAUGAGGUUGAGGAAAU 


2082 


GGAAAUACCUGGAGGAUGA 


2083 




Sense 




GAUCCACGUUUCAAUGAUA 


2084 


GCAUUUAGGUGGAGACUUC 


2085 


G AUG U ACG AU U AU AG U U UG 


2086 


UGAAUUAGAUCCACGUUUC 


2087 
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CTNNB1 



CXCR4 



CXCR6 



DAXX 



GAS41 



GTSE1 



Sense 




GCACAAGAAUGGAUCACAA 


2088 


GCUGAAACAUGCAGUUGUA 


2089 


G U ACG U ACCAUGC AG AAU A 


2090 


GAACUUGCAUUGUGAUUGG 


2091 




J v ')i'C -J l "K \Tr. Sense 




G AAGCAU G ACGG ACAAG U A 


2092 


GAACAUUCCAGAGCGUGUA 


2093 


GUUCUUAGUUGCUGUAUGU 


2094 


CAUCAUGGUUGGCCUUAUC 


2095 




Sense . •••• p:x$* 




GGAACAAACUGGCAAAGCA 


2096 


GAUCAGAGCAGCAGUGAAA 


2097 


GGG CAAA ACUG AAU U AU AA 


2098 I 


GAUCUCAGGUUCUCCUUGA 


2099 




: -iv Sense 




CUACAGAUCUCCAAUGAAA 


2100 


GCUACAAGCUGGAGAAUGA 


2101 


GGAAACAGCUAUGUGGAAA 


2102 


GGAGUUGGAUCUCUCAGAA 


2103 




•, Sense 




GUAGUAAGCUAAACUGAAA 


2104 


GACAAUAUGUUCAAGAGAA 


2105 


GACAACAUCUCGUCAGCUA 


2106 


UAUAUGAUGUGUCCAGUAA | 2107 




: ;> :<■',■/ Pi ' Sense iP?:,: ' 




CAAAGAAGCUCACU U ACUG 


2108 


G AAC AG CCCU AAAG U GG U U 


2109 


GAACAUGGAUGACCCUAAG 


2110 


GGGCAAAGCUAAAUCAAGU 


2111 



HDAC3 



§2' *. " S8nSG ' : : . 


■- ; A\ - ,r 




GGAAAGCGAUGUGGAGAUU 


2112 _ 


CCAAGACCGUGGCCUAUUU 


2113 




AAAGCGAUGUGGAGAUUUA 


2114 


G UGAGGAGCU UCCCU AUAG 


2115 



HDAC5 



HEC 



Sense ' i^ii* 




GAAUUCCUCUUGUCGAAGU 


2116 


GUUAUUAGCACCUUUAAGA 


2117 


GGAGGGAGGCCAUGACUUG 


2118 


CAGGAGAGCUCAAGAAUGG 


2119 


GGAUAUGGAUUUCAGUUAA 


2120 


GGAAGUCGGUGCCUUGGUU 


2121 


GGAAGGAGAGGACUGGUUU 


2122 




Sense 




GCAGAUACUUGCACGGUUU 


2123 


GAGUAGAACUAGAAUGUGA 


2124 


GCGAAUAAAUCAUGAAAGA 


2125 


GAAGAUGGAAUUAUGCAUA 


2126 
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HIST1H2 


Sense 




AA 


GGCAAUGCGUCUCGCGAUA 


2127 




GAUCCGCAAUGAUGAGGAA 


2128 




GCAAUGCGUCUCGCGAUAA 


2129 




GAGGAACUCAAUAAGCUUU 


2130 



LMNB1 



Sense 




AAUAGAAGCUGUGCAAUUA 


2131 


CAACUGACCUCAUCUGGAA 


2132 


GAAGGAAUCUGAUCUUAAU 


2133 


GGGAAGGGUUUCUCUAUUA 


2134 



LMNB2 



Sense : ''■■>;) 




GGAGGUUCAUUGAGAAUUG 


2134 


GGCAAUAGCUCACCGUUUA 


2135 


CAAAUACGCUUAGCUGUGU 


2136 


t GGAG AUCGCCU ACAAG UUC 


2137 



MYB 



Sense 




GCAGAAACACUCCAAUUUA 


2138 


IguaaauacgugaaOgcauu 


2139 


GCACUGAACUUUUGAGAUA 


2140 


[gaagaacagucauuugaug 


2141 



MYT1 



:: Sense 




GAGGUGAGCUGUUAAAUCA 


2142 


gcagggugauuuccuaaua 


2143 


GGGAGAAGAUAUUUAAUUG 


2144 


CAACUUCUCUCCUGAACUU 


2145 



NFKBIB 



\: r ' « . . Sense - ; " W^- " 


€ 


GGACACGGCACUGCACUUG 


2146 


GCACUUGGCUGUGAUUCAU 


2148 


GAGACGAGGGCGAUGAAUA 


2149 


CAUGAACCCUUCCUGGAUU 


2150 



NFKBIA 



. - i - Sense • - fm - 






GAACAUGGACUUGUAUAUU 


2151 




GAUGUGGGGUGAAAAGUUA 


2152 


GGACGAGAAAG AUCAU UGA 


2153 


AGGACGAGCUGCCCUAUGA 


2154 



NFKBIE 



NUMA1 



Sense 




GAAGGGAAG U U UCAG U AAC 


2155 


GGAAGGGAAGUUUCAGUAA 


2156 


GGAAACUGCUGCUGUGUAC 


2157 


GAACCAACCACUCAUGGAA 


2158 




Sense £ 




GGG AAC AG U U U G AAU AU AA 


2159 


G C AG U AG CC U G AAG C AG AA 


2160 


CGAGAAGGAUGCACAGAUA 


2161 


GCAACBAGGCUGAGAGGAAA 


2162 



NUP153 



Sense 
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GAAGACAAAUGAAAGCUAA 


2163 


GAUAAAGACUGCUGUUAGA 


2164 


GAGGAGAGCUCUAAUAUUA 


2165 


GAGGAAGCCUGAUUAAAGA 


2166 




'.'■-';?$■..■> Sense i \: '>■•;. 


G AAAG AG CAU G AUG AC AU A 


2167 


GAGGAGAGCUCUAUUAUGU 


2168 


GAAACUGAAUGGAAGAAUA 


2169 


AAAGAAGGCUGUACCGUUA 


2170 




k . Sense •, .•- -• 




CUACAUGUCUUUGCUCUUA 


2171 


GCUAAGUCCUGUAAGAAUA 


2172 


CAAAGGCAAUGUACUGUUU 


2173 


GAACAAUGGUGGAUCCAAA 


2174 




Sense 




AAGUUCAGCUUCUCUAUUA 


2175 


GAAGAAAUCUCUGAUGGAU 


2176 


GAACACCUUUACUCUAUAA 


2177 


GCAUGGAGCUGGAGAACUA 


2178 




Sense 




GAUGAAAGCUCUAAAGAUG 


2179 


GAAAGGAGGUUCUAAACUA 


2180 


GGAAGAAGCUCAUUUGAUU 


2181 


GCAAAGAGGUGGCAGUUAA 


2182 




Sense 




GGAAGAAGAUCCACAUGAA 


2183 


GAACAUACUUUCAGAGCUU 


2184 


GAACAAUCUUUGCUGUAUA 


2185 


UAACAGAACUGCUUCAACA 


2186 



SLC9A1 



■ ; Sense :" ^tP"--:. 




GAAGAGAUCCACACACAGU 


2187 


ucaaugagcugcugcacaO 


2188 


GAAG AU AGG U U U CCAUG UG 


2189 


GAAUUACCCUUCCUCAUCU 


2190 



TEGT 



; Sense 




CUACAGAGCUUCAGUGUGA 


2191 


G AAC AU AU U U G AUCG AAAG 


2192 


GAGCAAACCUAGAUAAGGA 


2193 


GCAUUGAUCUCUUCUUAGA 


2194 



TERT 



Sense 




GGAAGACAGUGGUGAACUU 


2195 


GCAAAGCAUUGGAAUCAGA 


2196 


IGAG CUG ACG UGGAAG AU G A 


2197 


GAACGGGCCUGGAACCAUA 


2198 



TNFRSF 
6 


Sense 






GAUACUAACUGCUCUCAGA 


2199 
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TOP1 



TOP2A 



TOP3A 



TOP3B 



WEE1 



GAAAGAAUGGUGUCAAUGA 


2200 


UCAAUAAUGUCCCAUGUAA 


2201 


UCAUGAAUCUCCAACCUUA 


2202 


GAUGUUGACUUGAGUAAAU 


2203 




' . jr Sense , ; 




GAAAGGAAAUGACUAAUGA 


2204 


GAAGAAGGCUGUUCAGAGA 


2205 


GGAAGUAGCUACGUUCUUU 


2206 


G G ACAU AAG UGG AAAG AAG 


2207 




£. *.: ' : ' Sense " W'k . k. $ ' 




GAAAGAGUCCAUCAGAUUU 


2208 


CAAACUACAUUGGCAUUUA 


2209 


AAACAGACAUGGAUGGAUA 


2210 | 


CGAAAGGAAUGGUUAACUA 


2211 




' ' " ' ; | ' Sense -1 - ; ^ : ■§ i; , 




CCAGAAAUCUUCCACAGAA 


2212 


GAAACUAUCUGGAUGUGUA 


2213 | 


CCACAAAGAUGGUAUCGUA 


2214 


GGAAAUGGCUGUGGUAACA 


2215 I 




Sense -flT -V 




GAG AC AAG AU G AAG AC U G U 


2216 


GCACAUGGGCUGCGUCUUU 


2217 


CCAGUGCGCUUCAAGAUGA 


2218 


GAACAUCUGCUUUGAGGUU 


2219 




; Sense . /~f-*-.'' 




GGUAUUGCCUUGUGAAUUU 


2220 


GCAGAACAAUUACGAAUAG 


2221 


GUACAUAGCUGUUUGAAAU 


2222 


GCUGUAAACUUGUAGCAUU 


2223 



In addition, to identifying functional siRNA against gene families or 
pathways, it is possible to design duplexes against genes known to be involved in 
5 specific diseases. For example when dealing with human disorders associated with 
allergies, it will be beneficial to develop siRNA against a number of genes including 
but not limited to: 
the interleukin 4 receptor gene 

(SEQ. ID NO. 2224: UAGAGGUGCUCAUUCAUUU, 
1 0 SEQ. ID NO. 2225 : GGUAUAAGCCUUUCCAAGA, 
SEQ. ID NO. 2225: ACACACAGCUGGAAGAAAU, 
SEQ. ID NO. 2226: UAACAGAGCUUCCUUAGGU), 



the Beta-arrestin-2 
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(SEQ. ID NO. 2227: GGAUGAAGGAUGACGACUA, 
SEQ. ID NO. 2228: ACACCAACCUCAUUGAAUU, 
SEQ. ID NO. 2229: CGAACAAGAUGACCAGGUA, 
SEQ. ID NO. 2230: GAUGAAGGAUGACGACUAU, ), 

5 

the interferon-ganrma receptor 1 gene 
(SEQ. ID NO. 2231: CAGCAUGGCUCUCCUCUUU, 
SEQ. ID NO. 2232: GUAAAGAACUAUGGUGUUA, 
SEQ. ID NO. 2233: GAAACUACCUGUUACAUUA, 
1 0 SEQ. ID NO. 2234: GAAGUGAGAUCCAGUAUAA), 

the matrix metalloproteinase MMP-9 
(SEQ. ID NO. 2235: GGAACCAGCUGUAUUUGUU, 
SEQ. ID NO. 2236: GUUGGAGUGUUUCUAAUAA, 
15 SEQ. ID NO. 2237: GCGCUGGGCUUAGAUCAUU, 
SEQ. ID NO. 223 8 : GGAGCCAGUUUGCCGGAUA), 

the Slcl lal (Nrampl) gene 

(SEQ. ID NO. 2239: CCAAUGGCCUGCUGAACAA, 
20 SEQ. ID NO. 2240: GGGCCUGGCUI5CCUCAUGA, 
SEQ. ID NO. 2241: GGGCAGAGCUCCACCAUGA, 
SEQ. ID NO. 2242: GCACGGCCAUUGCAUUCAA), 

SPEMK5 

25 (SEQ. ID NO. 2243 : CCAACUGCCUGUUCAAUAA, 
SEQ. ID NO. 2244: GGAUACAUGUGAUGAGUUU, 
SEQ. ID NO. 2245: GGACGAAUGUGCUGAGUAU, 
SEQ. ID NO. 2246: GAGCUUGUCUUAUUUGCUA,), 

30 the CYP1A2 gene 

(SEQ. ID NO. 2247: GAAAUGCUGUGUCUUCGUA, 
SEQ. ID NO. 2248: GGACAGCACUUCCCUGAGA, 
SEQ. ID NO. 2249: GAAGACACCACCAUUCUGA, 
SEQ. ID NO. 2250: GGCCAGAGCUUGACCUUCA), 
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thymosin-beta4Y 

(SEQ. ID NO. 2251: GGACAGGCCUGCGUUGUUU, 
SEQ. ID NO. 2252: GGAAAGAGGAAGCUCAUGA, 
5 SEQ. ID NO. 2253 : GCAAACACGUUGGAUGAGU, 
SEQ. ID NO. 2254: GGACUAUGCUGCCCUUUUG, 

activin A receptor IB 

(SEQ. ID NO. 2255: ACAAGACGCUCCAGGAUCU, 
10 SEQ. ID NO. 2254: GCAACAGGAUCGACUUGAG, 
SEQ. ID NO. 2255: GAAGCUGCGUCCCAACAUC, 
SEQ. ID NO. 2256: GCAUAGGCCUGUAAUCGUA, 
SEQ. ID NO. 2257: UCAGAGAGUUCGAGACAAA, 
SEQ. ID NO. 2258: UGCGAAAGGUUGUAUGUGA, 
15 SEQ. ID NO. 2259: GCAACAGGAUCGACUUGAG, 
SEQ. ID NO. 2260: GAAUAGCGUUGUGUGUUAU, 
SEQ. ID NO. 2261: UGAAUAGCGUUGUGUGUUA, 
SEQ. ID NO. 2262: GGGAUCAGUUUGUUGAAUA, 
SEQ. ID NO. 2263: GAGCCUGAAUCAUCGUUUA, ), 

20 

ADAM33 

(SEQ. ID NO. 2264: GGAAGU ACCUGG A ACU GU A, 
SEQ. ID NO. 2265 : GGACAGAGGGAACCAUUUA, 
SEQ. ID NO. 2266: GGUGAGAGGUAGCUCCUAA, 
25 SEQ. ID NO. 2267: AAAGACAGGUGGCCACUGA), 

the TAP 1 gene 

(SEQ. ID NO. 2268: GAAAGAUGAUCAGCUAUUU, 
SEQ. ID NO. 2269: CAACAGAACCAGACAGGUA, 
30 SEQ. ID NO. 2270: UGAGAAAUGUUCAGAAUGU, 
SEQ. ID NO. 2271: UACCUUCACUCGAAACUUA, 

COX-2 

(SEQ. ID NO. 2272: GAACGAAAGUAAAGAUGUU, 
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SEQ. ED NO. 2273: GGACUUAUGGGUAAUGUUA, 
SEQ. ID NO. 2274: UGAAAGGACUUAUGGGUAA, 
SEQ. ID NO. 2275: GAUCAGAGUUCACUUUCUU), 

5 ADPRT 

(SEQ. ID NO. 2276: GGAAAGAUGUUAAGCAUUU, 
SEQ. ID NO. 2277: CAUGGGAGCUCUUGAAAUA, 
SEQ. ID NO. 2278: GAACAAGGAUGAAGUGAAG, 
SEQ. ID NO. 2279: UGAAGAAGCUCACAGUAAA, ), 

10 

HDC 

(SEQ. ID NO. 2280: CAGCAGACCUUCAGUGUGA, 
SEQ. ID NO. 2281: GGAGAGAGAUGGUGGAUUA, 
SEQ. ID NO. 2282: GUACAGAGCUGGAGAUGAA, 
1 5 SEQ. ID NO. 2283 : GAACGUCCCUUCAGUCUGU), 

HnmT 

(SEQ. ID NO. 2284: CAAAUUCUCUCCAAAGUUC, 
SEQ. ID NO. 2285: GGAUAUAUCUGACUGCUUU, 
20 SEQ. ID NO. 2286: GAGCAGAGCUUGGGAAAGA, 
SEQ. ID NO. 2287: GAUAUGAGAUGUAGCAAAU), 

GATA-3 

(SEQ. ID NO. 2288: GAACUGCUUUCUUUCGUUU, 
25 SEQ. ID NO. 2289: GCAGUAUCAUGAAGCCUAA, 
SEQ. ID NO. 2290: GAAACUAGGUCUGAUAUUC, 
SEQ. ID NO. 2291: GUACAGCUCCGGACUCUUC), 

Gab2 

30 (SEQ. ID NO. 2292: GCACAACCAUUCUGAAGUU, 
SEQ. ID NO. 2293: GGACUUAGAUGCCCAGAUG, 
SEQ. ID NO. 2294: GAAGGUGGAUUCUAGGAAA, 
SEQ. ID NO. 2295: GGACUAGCCCUGCUGUUUA), and 
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STAT6 

(SEQ. ID NO. 2296: GAUAGAAACUCCUGCUAAU, 
SEQ. ID NO. 2297: GGACAUUUAUUCCCAGCUA, 
SEQ. ID NO. 2298: GGACAGAGCUACAGACCUA, 
5 SEQ. ID NO. 2299: GGAUGGCUCUCC AC AGAU A) . 

In addition, rationally designed siRNA or siRNA pools can be directed against 
genes involved in anemia, hemophila or hypercholesterolemia. Such genes would 
include, but are not be limited to: 
10 APOA5 

(SEQ. ID NO. 2300: GAAAGACAGCCUUGAGCAA, 
SEQ. ID NO. 2301 : GGACAGGGAGGCCACCAAA, 
SEQ. ID NO. 2302: GGACGAGGCUUGGGCUUUG, 
SEQ. ID NO. 2303: AGCAAGACCUCAACAAUAU), 

15 

HMG-CoA reductase 

(SEQ. ID NO. 2304: GAAUGAAGCUUUGCCCUUU, 
SEQ. ID NO. 2305: GAACACAGUUUAGUGCUUU, 
SEQ. ID NO. 2306: UAUCAGAGCUCUUAAUGUU, 
20 SEQ. ID NO. 2307: UGAAGAAUGUCUACAGAUA), 

NOS3 

(SEQ. ID NO- 2308: UGAAGCACCUGGAGAAUGA, 
SEQ. ID NO. 2309: CGGAACAGCACAAGAGUUA, 
25 SEQ. ID NO. 2310: GGAAGAAGACCUUUAAAGA, 
SEQ. ID NO. 2309: GCACAAGAGUUAUAAGAUC), 

ARH 

(SEQ. ID NO. 2310: CGAUACAGCUUGGCACUUU, 
30 SEQ. ID NO. 23 1 1 : GAGAAGCGCUGCCCUGUGA, 
SEQ. ID NO. 2312: GAAUCAUGCUGUUCUCUUU, 
SEQ. ID NO. 2313: GGAGUAACCGGACACCUUA), 



CYP7A1 
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(SEQ. ID NO. 2314: UAAGGUGACUCGAGUGUUU, 
SEQ. ID NO. 2315: AAACGACACUUUCAUCAAA, 
SEQ. ID NO. 2316: GGACUCAAGUUAAAGUAUU, 
SEQ. ID NO. 2317: GUAAUGGACUCAAGUUAAA), 

5 

FANCA 

(SEQ. ID NO. 2318: GGACAUCACUGCCCACUUC, 
SEQ. ID NO. 2319: AGAGGAAGAUGUUCACUUA, 
SEQ. ID NO. 2320: GAUCGUGGCUCUUCAGGAA, 
1 0 SEQ. ID NO. 232 1 : GGACAGAGGCAGAUAAGAA), 

FANCG 

(SEQ. ID NO. 2322: GCACUAAGCAGCCUUCAUG, 
SEQ. ID NO. 2323: GCAAGCAGGUGCCUACAGA, 
15 SEQ. ID NO. 2324: GGAAUUAGAUGCUCCAUUG, 
SEQ. ID NO. 2325: GGACAUCUCUGCCAAAGUC), 

ALAS 

(SEQ. ID NO. 2326: CAAUAUGCCUGGAAACUAU, 
20 SEQ. ID NO. 2327: GGUUAAGACUCACCAGUUC,; •* 
SEQ. ID NO. 2328: CAACAGGACUUUAGGUUCA, 
SEQ. ID NO. 2329: GCAUAAGAUUGACAUCAUC), 



PIGA 

25 (SEQ. ID NO. 2330: GAAAGAGGGCAUAAGGUUA, 
SEQ. ID NO. 2331: GGACUGAUCUUUAAACUAU, 
SEQ. ID NO. 2332: UCAAAUGGCUUACUUCAUC, 
SEQ. ID NO. 2333: UCUAAGAACUGAUGUCUAA), and 

30 factor VIII 

(SEQ. ID NO. 2334: GCAAAUAGAUCUCCAUUAC, 
SEQ. ID NO. 2335: CCAGAUAUGUCGUUCUUUA, 
SEQ. ID NO. 2336: GAAAGGCUGUGCUCUCAAA, 
SEQ. ID NO. 2337: GGAGAAACCUGCAUGAAAG, 
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SEQ. ID NO. 2338: CUUGAAGCCUCCUGAAUUA, 
SEQ. ID NO. 2339: GAGGAAGCAUCCAAAGAUU, 
SEQ. ID NO. 2340: GAUAGGAGAUACAAACUUU). 



5 Furthermore, rationally designed siRNA or siRNA pools can be directed 

against genes involved in disorders of the brain and nervous system. Such genes 
would include, but are not be limited to: 
APBB1 

(SEQ. ID NO. 2341: CUACGUAGCUCGUGAUAAG, 
10 SEQ. ID NO. 2342: GCAGAGAUGUCCACACGUU, 
SEQ. ID NO 2343: CAUGAGAUCUGCUCUAAGA, 
SEQ. ID NO 2344: GGGCACCUCUGCUGUAUUG), 

BACE1 

1 5 (SEQ. ID NO 2345 : CCACAGAGCAAGUGAUUUA, 
SEQ. ID NO. 2346: GCAGAAAGGAGAUCAUUUA, 
SEQ. ID NO. 2347: GUAGCAAGAUCUUUACAUA, 
SEQ. ID NO 2348: UGUCAGAGCUUGAUUAGAA), 



20 PSEN1 

(SEQ. ID NO 2349: GAGCUGACAUUGAAAUAUG, 
SEQ. ID NO 2350: GUACAGCUAUUUCUCAUCA, 
SEQ. ID NO, 2351: GAGGUUAGGUGAAGUGGUU, 
SEQ. ID NO. 2352: GAAAGGGAGUCACAAGACA, 
25 SEQ. ID NO. 2353 : GAACUGGAGUGGAGUAGGA, 
SEQ. ID NO. 2354: CAGCAGGCAUAUCUCAUUA, 
SEQ. ID NO. 2355: UCAAGUACCUCCCUGAAUG), 



PSEN2 

30 (SEQ. ID NO. 2356: GCUGGGAAGUGGCUUAAUA, 
SEQ. ID NO. 2357: CAUAUUCCCUGCCCUGAUA, 
SEQ. ID NO. 2358: GGGAAGUGCUCAAGACCUA, 
SEQ. ID NO. 2359: CAUAGAAAGUGACGUGUUA), 
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MASS1 . 

(SEQ. ID NO. 2360: GGAAGGAGCUGUUAUGAGA, 
SEQ. ID NO. 2361: GAAAGGAGAAGCUAAAUUA, 
SEQ. ID NO. 2362: GGAGGAAGGUCAAGAUUUA, 
5 SEQ. ID NO. 2363 : GGAAAUAGCUGAGAUAAUG, ), 

ARX 

(SEQ. ID NO. 2364: CCAGACGCCUGAUAUUGAA, 
SEQ. ID NO. 2365: CAGCACCACUCAAGACCAA, 
1 0 SEQ. ID NO. 23 66: CGCCUGAUAUUGAAGUAAA, 

SEQ. ID NO. 2367: CAACAUCCACUCUCUCUUG) and 

NNMT 

(SEQ. ID NO. 2368: GGGCAGUGCUCCAGUGGUA, 
15 SEQ. ID NO. 2369: GAAAGAGGCUGGCUACACA, 
SEQ. ID NO. 2370: GUACAGAAGUGAGACAUAA, 
SEQ. ID NO. 2371: GAGGUGAUCUCGC AA AGUU) . 

In addition, rationally designed siRNA or siRNA pools can be directed against 
20 genes involved in hypertension and related disorders. Such genes would include, but 
are not be limited to: 
angiotensin II type 1 receptor 

(SEQ. ID NQ..23L72: CAAGAAGCCUGCACCAUGU, - 
SEQ. ID NO. 2373: GCACUUCACUACCAAAUGA, 
25 SEQ. ID NO. 2374: GCACUGGUCCCAAGUAGUA, 
SEQ. ID NO. 2375: CCAAAGGGCAGUAAAGUUU, 
SEQ. ID NO. 2376: GCUCAGAGGAGGUGUAUUU, 
SEQ. ID NO. 2377: GCACUUCACUACCAAAUGA, 
SEQ. ID NO. 2378: AAAGGGCAGUAAAGUUU), 

30 

AGTR2 

(SEQ. ID NO. 2379: GAACAUCUCUGGCAACAAU, 
SEQ. ID NO. 2380: GGUGAUAUAUCUCAAAUUG, 
SEQ. ID NO. 2381: GCAAGCAUCUUAUAUAGUU, 
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SEQ. ID NO. 2382: GAACCAGUCUUUCAACUCA), and other related targets. 



Example XIII: Validation of Multigene Knockout using Rab5 and Eps 

Two or more genes having similar, overlapping functions often leads to 

5 genetic redundancy. Mutations that knockout only one of, e.g., a pair of such genes 
(also referred to as homologs) results in little or no phenotype due to the fact that the 
remaining intact gene is capable of fulfilling the role of the disrupted counterpart. To 
fully understand the function of such genes in cellular physiology, it is often 
necessary to knockout or knockdown both homologs simultaneously. Unfortunately, 

10 concomitant knockdown of two or more genes is frequently difficult to achieve in 

higher organisms (e.g. mice) thus it is necessary to introduce new technologies dissect 
gene function. One such approach to knocking down multiple genes simultaneously 
is by using siRNA. For example, Figure 11 showed that rationally designed siRNA 
directed against a number of genes involved in the clathrin-mediated endocytosis 

15 pathway resulted in significant levels of protein reduction (e.g. >80%). To determine 
the effects of gene knockdown on clathrin-related endocytosis, internalization assays 
were performed using epidermal growth factor and transferrin. Specifically, mouse 
receptor-grade EGF (Collaborative Research Inc.) and iron-saturated human 
transferrin (Sigma) were iodinated as described previously (Jiang, X., Huang, F., 

20 Marusyk, A. & Sorkin, A. (2003) MolBiol Cell 14, 858-70). HeLa cells grown in 12- 

well dishes were incubated with 125 I-EGF (1 ng/ml) or I-transferrin (1 |^g/ml) in 
binding medium (DMEM, 0.1% bovine serum albumin) at 37°C, and the ratio of 
internalized' and surface radioactivity was determined during 5-min time course to 
calculate specific internalization rate constant ke as described previously (Jiang, X et 
25 al). The measurements of the uptakes of radiolabeled transferrin and EGF were 

performed using short time-course assays to avoid influence of the recycling on the 
uptake kinetics, and using low ligand concentration to avoid saturation of the clathrin- 
dependent pathway (for EGF Lund, K. A., Opresko, L. K., Strarbuck, C, Walsh, B. J. 
& Wiley, H. S. (1990) J. Biol Chem. 265, 15713-13723). 



30 



The effects of knocking down Rab5a, 5b, 5c, Eps, or Eps 15R (individually) 
are shown in Figure 22 and demonstrate that disruption of single genes has little or no 
effect on EGF or Tfh internalization. In contrast, simultaneous knock down of Rab5a, 
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5b 5 and 5c, or Eps and Eps 15R, leads to a distinct phenotype (note: total 
concentration of siRNA in these experiments remained constant with that in 
experiments in which a single siRNA was introduced, see Figure 23). These 
experiments demonstrate the effectiveness of using rationally designed siRNA to 
5 knockdown multiple genes and validates the utility of these reagents to override 
genetic redundancy. 

Example XIV. Validation of Multigene Targeting Using G6PD, GAPDH, PLK, 
and UQC. 

10 Further demonstration of the ability to knock down expression of multiple 

genes using rationally designed siRNA was performed using pools of siRNA directed 
against four separate genes. To achieve this, siRNA were transfected into cells (total 
siRNA concentration of lOOnM) and assayed twenty-four hours later by B-DNA. 
Results shown in Figure 24 show that pools of rationally designed molecules are 

1 5 capable of simultaneously silencing four different genes. 

Example XV. Validation of Multigene Knockouts As Demonstrated by Gene 
Expression Profiling, a Prophetic Example 

To further demonstrate the ability to concomitantly knockdown the expression 
20 of multiple gene targets, single siRNA or siRNA pools directed against a collection of 
genes (e.g. 4, 8, 16, or 23 different targets) are simultaneously transfected into cells 
and cultured for twenty-four hours. Subsequently, niRNA is harvested from treated 
(and untreated) cells andJabeled with one of two fluorescent probes dyes (e.g. a red 
fluorescent probe for the treated cells, a green fluorescent probe for the control cells.). 
25 Equivalent amounts of labeled RNA from each sample is then mixed together and 
hybridized to sequences that have been linked to a solid support (e.g. a slide, "DNA 
CHIP"). Following hybridization, the slides are washed and analyzed to assess 
changes in the levels of target genes induced by siRNA. 

30 Example XVI. Identifying Hyperfunctional siRNA 



Identification of Hyperfunctional Bcl-2 siRNA 

The ten rationally designed Bcl2 siRNA (identified in Figure 13, 14) were 
tested to identify hyperpotent reagents. To accomplish this, each of the ten Bcl-2 
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siRNA were individually transfected into cells at a 300pM (0.3nM) concentrations. 
Twenty-four hours later, transcript levels were assessed by B-DNA assays and 
compared with relevant controls. As shown in Figure 25, while the majority of Bcl-2 
siRNA failed to induce functional levels of silencing at this concentration, siRNA 1 
5 and 8 induced >80% silencing, and siRNA 6 exhibited greater than 90% silencing at 
this subnanomolar concentration. 



By way of prophetic examples, similar assays could be performed with any of 
the groups of rationally designed genes described in Example VII or Example VIII. 
10 Thus for instance, rationally designed siRNA sequences directed against 
PDGFA 

(SEQ. ID NO. 2383: GGUAAGAUAUUGUGCUUUA, 
SEQ. ID NO. 2384: CCGCAAAUAUGGAGAAUUA, 
SEQ. ID NO. 2385: GGAUGUACAUGGCGUGUUA, 
1 5 SEQ. ID NO. 2386: GGUGAAGUUUGUAUGUUUA), or 



PDGFB 

(SEQ. ID NO. 2387: GCUCCGCGCUUUCCGAUUU, 
SEQ. ID NO. 2388: GAGCAGGAAUGGUGAGAUG, 
20 SEQ. ID NO. 2389: GAACUUGGGAUAAGAGUGU, ,:, 
SEQ. ID NO. 2390: CCGAGGAGCUUUAUGAGAU, 
SEQ. ID NO. 2391: UUUAUGAGAUGCUGAGUGA) 

could be introduced intorcells at increasingly limiting concentrations to determine 
whether any of the duplexes are hyperfunctional. Similarly, rationally designed 
25 sequences directed against 
HIF1 alpha 

(SEQ. ID NO. 2392: GAAGGAACCUGAUGCUUUA, 
SEQ. ID NO. 2393: GCAUAUAUCUAGAAGGUAU, 
SEQ. ID NO. 2394: GAACAAAUACAUGGGAUUA, 
30 SEQ. ID NO. 2395: GGACACAGAUUUAGACUUG), or 

VEGF 

(SEQ. ID NO. 2396: GAACGUACUUGCAGAUGUG, 
SEQ. ID NO. 2397: GAGAAAGCAUUUGUUUGUA, 
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SEQ. ID NO. 2398: GGAGAAAGCAUUUGUUUGU, 

SEQ. ID NO. 2399: CGAGGCAGCUUGAGUUAAA) could be introduced into cells 
at increasingly limiting concentrations and screened for hyperfunctional duplexes. 



5 Example XVII: Gene Silencing: Prophetic Example 

Below is an example of how one might transfect a cell. 

a. Select a cell line. The selection of a cell line is usually determined by the 
desired application. The most important feature to RNAi is the level of 
expression of the gene of interest. It is highly recommended to use cell lines 
10 for which siRNA transfection conditions have been specified and validated. 



b. Plate the cells. Approximately 24 hours prior to transfection, plate the cells at 
the appropriate density so that they will be approximately 70 — 90% confluent, 
or approximately 1 x 10 5 cells/ml at the time of transfection. Cell densities 

1 5 that are too low may lead to toxicity due to excess exposure and uptake of 

transfection reagent-siRNA complexes. Cell densities that are too high may 
lead to low transfection efficiencies and little or no silencing. Incubate the 
cells overnight. Standard incubation conditions for mammalian cells are 37°C 
in 5% C0 2 . Other cell types, such as insect cells, require different 

20 temperatures and C0 2 concentrations that are readily ascertainable by persons 

skilled in the art. Use conditions appropriate for the cell type of interest. 

c. SiRNA re-suspension-^Add 20 \xl siRNA universal buffer to each siRNA to 
generate a final concentration of 50 |oM. 

25 

d. SiRNA-lipid complex formation. Use RNase-free solutions and tubes. Using 
the following table, Table XI: 
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e. 



Table XI 




96-well 




24^well 


Mixture 1 (TransIT-TKO-Plasmid dilution mixture) 


Opti-MEM 


93 [il 


46.5 Jul 


TransIT-TKO (1 ug/ul) 


0.5 ul 


2.5 Jul 








^.JMtoftire lBnal^winifei 


lOiO ul 


: * 50*01*1 



Mixture 2 (siRNA dilution mixture) 


Opti-MEM 


9.0 jllI 


45.0 |il 


siRNA (1 uM) 


1.0 ul 


5.0 ul 


Mixture 2 Final Volume : f : 


lo.oui 


50.0 ul% 




Mixture 3 (siRNA-Transfection reagent mixture) 


Mixture 1 


10 JLXl 


50 ul 


Mixture 2 


10 |il 


50 |il 


fixture 3 Final Volume 


T' mm 


' 100 ul • £ 




Incubate 20 minutes at room temperature. 


Mixture 4 (Media-siRNA/Transfection reagent mixture) 


Mixture 3 


20 ul 


100 ul 


Complete media 


80 ul 


400 |il 




100 Ml 


•w • 500 iii • , V: 




Incubate 48 hours at 37°C. 



5 Transfection . Create a Mixture 1 by combining the specified amounts of OPTI-MEM 
serum free media and transfection reagent in a sterile polystyrene tube. Create a 
Mixture 2 by combining specified amounts of each siRNA with OPTI-MEM media in 
sterile 1 ml tubes. Create a Mixture 3 by combining specified amounts of Mixture 1 
and Mixture 2. Mix gently (do not vortex) and incubate at room temperature for 20 
10 minutes. Create a Mixture 4 by combining specified amounts of Mixture 3 to 

complete media. Add appropriate volume to each cell culture well. Incubate cells 
with transfection reagent mixture for 24 - 72 hours at 37°C. This incubation time is 
flexible. The ratio of silencing will remain consistent at any point in the time period. 
Assay for gene silencing using an appropriate detection method such as RT-PCR, 

1 5 Western blot analysis, immunohistochemistry, phenotypic analysis, mass 
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spectrometry, fluorescence, radioactive decay, or any other method that is now known 
or that comes to be known to persons skilled in the art and that from reading this 
disclosure would useful with the present invention. The optimal window for 
observing a knockdown phenotype is related to the mRNA turnover of the gene of 
5 interest, although 24 - 72 hours is standard. Final Volume reflects amount needed in 
each well for the desired cell culture format. When adjusting vohimes for a Stoc£~ 
Mix, an additional 10% should be used to accommodate variability in pipetting, etc. 
Duplicate or triplicate assays should be carried out when possible. 

10 While the invention has been described in connection with specific 

embodiments thereof, it will be understood that it is capable of further modifications 
and this application is intended to cover any variations, uses, or adaptations of the 
invention following, in general, the principles of the invention and including such 
departure from the present disclosure as come within known or customary practice 

1 5 within the art to which the invention pertains and as may be applied to the essential 
features hereinbefore set forth and as follows in the scope of the appended claims. 



. -20 
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Claims 

1 . A method for selecting siRNA comprising selecting an siRNA molecule of 
19-25 nucleoside bases, said method comprising: 
(a) selecting a target gene; 
5 (b) measuring the functionality of sequences of 19 - 25 nucleotides in 

length that are substantially complementary to a stretch of nucleotides 
of the target sequence, wherein said functionality is dependent upon 
non-target specific criteria. 

10 2. The method according to claim 1 wherein said functionality is determined 

by applying one of the following formulas: 

Formula I = -(GC/3) +(AUi 5 -i 9 ) -(Tm 20 °c)*3 -(G 13 )*3 -(C 19 ) +(A 19 )*2 +(A 3 ) 
+(Ui 0 )+(A 14 )-(U 5 )-(Ai 1 ); 

Formula II = -(GC/3) -(AUi 5 -i 9 )*3 -(Gi 3 )*3 -(C i9 ) +(Ai 9 )*2 +(A 3 ); 

Fonnula III = -(GC/3) +(AUi 5 -i 9 ) -(Tm 20 °c)*3; 

20 Formula IV - -(GC/2)+(AUi 5 -i 9 )/2-(Tm 2 o<'c)*2 -(G 13 )*3 -(Q 9 ) +(A i9 )*2 

+(A 3 ) +(U 10 )+(A 14 )-(U 5 )-(A 11 ); 

Formula V = -(G i3 )*3 -(C l9 )MM^*2 +(A 3 ) + (Ui 0 )+(Ai 4 ) -(U 5 ) -(An); 

25 Formula VI = -(G 13 )*3 »(Ci 9 ) +(A X9 )*2 +(A 3 ); 

Formula VII - -(GC/2) +(AUi 5 -i 9 )/2 -( Tm 20 °c)*l -(G i3 )*3 -(C 19 ) +(A X9 )*3 
+(A 3 )*3 +(Ui 0 )/2+(A 14 )/2 -(U 5 )/2 -(A n )/2; 
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wherein in Formulas I - VII: 



AUi5_i 9 = 0-5 depending on the number of A or U bases on the sense 
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strand at positions 15-19; 
G13 = 1 if G is the base at position 13 on the sense strand, otherwise its 
value is 0; 

Ci 9 = 1 if C is the base at position 19 of the sense strand, otherwise its 
5 value is 0; 

GC = the number of G and C bases in the entire sense strand; 
Tm 2o°c= 1 if the Tm is greater than 20°C; 

A 3 = 1 if A is the base at position 3 on the sense strand, otherwise its 
value is 0; 

10 An = 1 if A is the base at position 1 1 on the sense strand, otherwise its 

value is 0; 

Ai4= 1 if A is the base at position 14 on the sense strand, otherwise its 
value is 0; 

A19 = 1 if A is the base at position 19 on the sense strand, otherwise its 
15 value is 0; 

U 5 = 1 if U is the base at position 5 on the sense strand, otherwise its 
value is 0; 

Uio= 1 if U is the base at position 10 on the sense strand, otherwise its 
value is 0; 

20 .... or, 



Formula VIII: (-14)*Gi 3 -13*A r 12*W 

9*Aio-9*U 9 -9*Ci8-8*Gio-7*U 1 -7*Ui6-7 !!! Ci7-7*Ci9 
25 +7*Ui7+8*A2+8*A4+8*A5+8*C4+9*G8+10*A 7 +10^Ui8+ll*Ai9+ 

ll*C 9 +15*Gi+ 18*A 3 +19*Uio-Tm-3* (GC tot ai) - 6*(GCi 5 -i9)- 
30*X; and 

Formula IX: (14J)*A3^14.9)*A^ 
30 . C 9 +(23.9)*Gi+(163)*G 2 +(-12^ 

(-n)HJ 3 +(-15.2)*Ui 5 H<-l^ 

10.5)*C 7 + (-13.7)*Gi 3 +(-25.9)*Gi 9 -Tm-3* (GC tota i) - 6*(GCi 5 -i 9 )- 
30*X 

wherein 
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Ai = 1 if A is the base at position 1 of the sense strand, otherwise its value is 0; 
A 2 = 1 if A is the base at position 2 of the sense strand, otherwise its value is 0; 
A 3 = 1 if A is the base at position 3 of the sense strand, otherwise its value is 0; 
A4= 1 if A is the base at position 4 of the sense strand, otherwise its value is 0; 
5 A 5 = 1 if A is the base at position 5 of the sense strand, otherwise its value is 0; 
A 6 = 1 if A is the base at position 6 of the sense strand, otherwise its value is 0; 
A 7 = 1 if A is the base at position 7 of the sense strand, otherwise its value is 0; 
Aio = 1 if A is the base at position 10 of the sense strand, otherwise its value is 0; 
An = 1 if A is the base at position 1 1 of the sense strand, otherwise its value is 0; 
10 A13 = 1 if A is the base at position 13 of the sense strand, otherwise its value is 0; 

A19 = 1 if A is the base at position 19 of the sense strand, otherwise if another base 
is present or the sense strand is only 18 base pairs in length, its value is 0; 

C3 = 1 if C is the base at position 3 of the sense strand, otherwise its value is 0; 
15 C 4 = 1 if C is the base at position 4 of the sense strand, otherwise its value is 0; 

C 5 = 1 if C is the base at position 5 of the sense strand, otherwise its value is 0; 

C$ = 1 if C is the base at position 6 of the sense strand, otherwise its value is 0; 

C7 = 1 if C is the base at position 7 of the sense strand, otherwise its value is 0; 

C9 = 1 if C is the base at position 9 of the sense strand, otherwise its value is 0; 
20 C17 = 1 if C is the base at position 17 of the sense strand, otherwise its value is 0; 

Ci8 — 1 if C is the base at position 18 of the sense strand, otherwise its value is 0; 

C19 = 1 if C is the base at position 19 of the sense strand, otherwise if another base 
is present or the sense strand is onlyJAfease pairs in length, its value is 0; 

25 Gi = 1 if G is the base at position 1 on the sense strand, otherwise its value is 0; 

G 2 = 1 if G is the base at position 2 of the sense strand, otherwise its value is 0; 

Gg = 1 if G is the base at position 8 on the sense strand, otherwise its value is 0; 

G10 = 1 if G is the base at position 10 on the sense strand, otherwise its value is 0; 

Gb = 1 if G is the base at position 13 on the sense strand, otherwise its value is 0; 
30 G19 = 1 if G is the base at position 19 of the sense strand, otherwise if another base 
is present or the sense strand is only 18 base pairs in length, its value is 0; 

Ui = 1 if U is the base at position 1 on the sense strand, otherwise its value is 0; 
U2 = 1 if U is the base at position 2 on the sense strand, otherwise its value is 0; 
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U 3 = 1 if U is the base at position 3 on the sense strand, otherwise its value is 0; 
U 4 = 1 if U is the base at position 4 on the sense strand, otherwise its value is 0; 
U 7 = 1 if U is the base at position 7 on the sense strand, otherwise its value is 0; 
U 9 = 1 if U is the base at position 9 on the sense strand, otherwise its value is 0; 
5 Uio = 1 if U is the base at position 10 on the sense strand, otherwise its value is 0; 
Uis = 1 if U is the base at position 15 on the sense strand, otherwise its value is 0; 
Ui6 = 1 if U is the base at position 16 on the sense strand, otherwise its value is 0; 
Un = 1 if U is the base at position 17 on the sense strand, otherwise its value is 0; 
Uis = 1 if U is the base at position 18 on the sense strand, otherwise its value is 0; 

10 

GC15-19 = the number of G and C bases within positions 15 - 19 of the sense strand 

or within positions 15 —18 if the sense strand is only 18 base pairs in length; 
GCtotai = the number of G and C bases in the sense strand; 

Tm= 100 if the targeting site contains an inverted repeat longer then 4 base pairs, 
15 otherwise its value is 0; and 

X = the number of times that the same nucleotide repeats four or more times in a 
row. 



3. A method of gene-silencing comprising selecting an siRNA according to 
20 , claim 2 and introducing it into a cell. 

4. The method according to claim 3 wherein said introducing is by allowing 
passive uptake of the siRNA. ••j^^^ssa^ , 

25 5. The method according to claim 3, wherein said introducing is through the 

use of a vector. 



6. A method for developing an siRNA algorithm for selecting siRNA, said 
method comprising: 
30 (a) selecting a set of siRNA; 

(b) measuring the gene silencing ability of each siRNA from said set; 

(c) determining the relative functionality of each siRNA; 

(d) determining the amount of improved functionality by the presence or 
absence of at least one variable selected from the group consisting of 
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the total GC content, melting temperature of the siRNA, GC content at 
positions 15-19, the presence or absence of a particular nucleotide at 
a particular position and the number of times that the same nucleotide 
repeats within a given sequence; and 
5 (e) developing an algorithm using the information of step (d). 

7. A method of selecting an siRNA with improved functionality, said method 
comprising using the algorithm of claim 6. 

10 8 . A method of selecting hyperfunctional siRNA, said method comprising 

using at least one functional siRNA, wherein at least one said functional 
siRNA has been selected according to the method of claim 7 and 
measuring the silencing ability of said at least one functional siRNA, 
wherein silencing ability is measured at a concentration of less than 1 

1 5 nanomolar siRNA. 

9. An siRNA molecule, wherein said siRNA molecule is effective at 
silencing Bcl-2. 

20 10. The siRNA molecule of claim 9, wherein said siRNA molecule comprises 

a sequence substantially similar to a sequence selected from the group 

consisting of GGGAGAUAGUGAUGAAGUA (SEQ. ID NO. 301); 

GAAGUACAUCCAUUAUAAG (SBQ*mKEQ. 302); 

GUACGACAACCGGGAGAUA (SEQ. ID NO. 303); 
25 AGAUAGUGAUGAAGUACAU (SEQ. ID NO. 304); 

UGAAGACUCUGCUCAGUUU (SEQ. ID NO. 305); 

CAUGCGGCCUCUGUUUGA (SEQ. ID NO. 306); 

UGCGGCCUCUGUUUGAUUU (SEQ. ID NO. 307); 

GAGAUAGUGAUGAAGUACA (SEQ. ID NO. 308); 
30 GGAGAUAGUGAUGAAGUAG (SEQ. ID NO. 309); and 

GAAGACUCUGCUCAGUUUG (SEQ. ID NO. 310). 



1 1 . The siRNA molecule of claim 1 0, wherein said siRNA molecule 
comprises a sequence selected from the group consisting of 
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GGGAGAUAGUGAUGAAGUA (SEQ. ID NO. 301); 
GAAGUACAUCCAUUAUAAG (SEQ. ID NO. 302); 
GUACGACAACCGGGAGAUA (SEQ. ID NO. 303); 
AGAUAGUGAUGAAGUACAU (SEQ. ID NO. 304); 
UGAAGACUCUGCUCAGUUU (SEQ. ID NO. 305); 
GCAUGCGGCCUCUGUUUGA (SEQ. ID NO. 306); 
UGCGGCCUCUGUUUGAUUU (SEQ. ID NO. 307); 
GAGAUAGUGAUGAAGUACA (SEQ. ID NO. 308); 
GGAGAUAGUGAUGAAGUAC (SEQ. ID NO. 309); and 
GAAGACUCUGCUCAGUUUG (SEQ. ID NO. 3 1 0). 

1 2. The siRNA molecule of claim 1 1 ^herein said siRNA molecule 
comprises GCAUGCGGCCUCUGUUUGA . 

13. The siRNA molecule of claim 9, wherein said siRNA molecule comprises 
a sense strand and an anti-sense strand. 

14. The siRNA molecule of claim 9, wherien said siRNA molecule comprises 

a hairpin. 

r ' 

15. The siRNA molecule of claim 9, wherein said siRNA molecule comprises 
between 18 and 30 base pairs. 

16. A kit for gene silencing comprising at least one siRNA selected from the 
group consisting of sequences substantially similar to the group consisting 
of GGGAGAUAGUGAUGAAGUA (SEQ. ID NO. 301); 
GAAGUACAUCCAUUAUAAG (SEQ. ID NO. 302); 
GUACGACAACCGGGAGAUA (SEQ. ID NO. 303); 
AGAUAGUGAUGAAGUACAU (SEQ. ID NO. 304); 
UGAAGACUCUGCUCAGUUU (SEQ. ID NO. 305); 
GCAUGCGGCCUCUGUUUGA (SEQ. ID NO. 306); 
UGCGGCCUCUGUUUGAUUU (SEQ. ID NO. 307); 
GAGAUAGUGAUGAAGUACA (SEQ. ID NO. 308); 
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GGAGAUAGUGAUGAAGUAC (SEQ. ID NO. 309); and 
GAAGACUCUGCUCAGUUUG (SEQ. ID NO. 310). 



17. A method of gene silencing comprising using the siRNA molecule of 
5 claim 10. 



18. A method of gene silencing comprising using the siRNA molecule of claim 
11. 



10 19. A kit, wherein said kit is comprised of at least two siRNA, wherein said at 

least two siRNA comprise a first optimized siRNA and a second optimized 
siRNA, wherein said first optimized siRNA and said second optimized 
siRNA are optimized according to one of the following formulas: 



1 5 Formula I = -(GC/3) +(AUi 5 -i 9 ) -(Tm 20 °c)*3 -(Gi 3 )*3 -(Ci 9 ) +(A 19 )*2 +(A 3 ) 

+(U 10 )+(A 14 )-(U 5 )-(A 11 ); 

Formula II = -(GC/3) -(AUi 5 -i 9 )*3 -(Gi 3 )*3 -(C 19 ) +(A 19 )*2 +(A 3 ); 

20 Formula III = -(GC/3) +(AUi 5 -i 9 ) -(Tm 20 °c)*3 ; 

Formula IV = -(GC/2)+(AUi 5 -i 9 )/2-(Tm 2 o°c)*2 -(G 13 )*3 -(Ci 9 ) +(A i9 )*2 
+(A 3 ) +(Uio)+(A 14 )-(U 5 )-(A 11 ); • 

25 Formula V = ~(Gi 3 )*3 -(Ci 9 ) +(A 19 )*2 +(A 3 ) + (Ui 0 )+(Ai 4 ) -(U 5 ) -(Ai i); 

Formula VI = -(G 13 )*3 -(Ci 9 ) +(A i9 )*2 +(A 3 ); 

Formula VII = -(GC/2) +(AUi 5 -i 9 )/2 -( Tm 20 °c)*l ~(Gi 3 )*3 -(Ci 9 ) +(A X9 )*3 
30 +(A 3 )*3 +(U 10 )/2+(A 14 )/2-(U 5 )/2-(An)/2;. 



wherein in Formulas I — VII: 
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AUi5~i9 = 0-5 depending on the number of A or U bases on the sense 

strand at positions 15—19; 
G13 = 1 if G is the base at position 13 on the sense strand, otherwise its 
5 value is 0; 

C19 = 1 if C is the base at position 19 of the sense strand, otherwise its 
value is 0; 

GC = the number of G and C bases in the entire sense strand; 
Tm 2o°c = 1 if the Tm is greater than 20°C; 
10 A3 = 1 if A is the base at position 3 on the sense strand, otherwise its 

value is 0; 

An = 1 if A is the base at position 1 1 oil the sense strand, otherwise its 
value is 0; 

A14 = 1 if A is the base at position 14 on the sense strand, otherwise its 
15 value is 0; 

A19 = 1 if A is the base at position 19 on the sense strand, otherwise its 
value is 0; 

U 5 = 1 if U is the base at position 5 on the sense strand, otherwise its 
value is 0; 

20 U10 = Lif U is the base at position 10 on the sense strand, otherwise its 

value is 0; 



25 Formula VIII: (-14)*Gi 3 -13*Ai-12*U 7 -ll*U 2 ^^ 

9*Aio-9*U 9 -9*Ci8-8*Gio-7*Ui-7*Ui6-7*Ci7-7*Ci9 
+7*Ui7+8*A 2 +8*A4+8*A5+8*C4+9*G 8 +10*A 7 +10*Ui 8 +ll*Ai9+ 
ll*C 9 +15*Gi+ 18*A 3 +19*Uio-Tm-3* (GC to tai)-6*(GCi 5 -i9)- 
30*X; and 

30 

Formula IX: (14.1)*A 3 +(14.9)*A 6 +(17.6)^ 

C 9 +(23.9)*Gi+(163)*G 2 +(-12.3)*Aii+(-193)nJi+(-12J)*U 2 + 



WO 2004/045543 PCT/US2003/036787 

168 

(-ii)nj 3 +<-i5.2)nj^ 

10.5)*C 7 + (-13.7)*Gi3+(-25.9)*Gi 9 -Tm-3* (GC tota i) - 6*(GC 15 -i 9 )- 
30*X 

wherein 

5 Ai = 1 if A is the hase at position 1 of the sense strand, otherwise its value is 0; 

A2 = 1 if A is the base at position 2 of the sense strand, otherwise its value is 0; 

A 3 = 1 if A is the base at position 3 of the sense strand, otherwise its value is 0; 

A4 = 1 if A is the base at position 4 of the sense strand, otherwise its value is 0; 

A 5 = 1 if A is the base at position 5 of the sense strand, otherwise its value is 0; 
10 As = 1 if A is the base at position 6 of the sense strand, otherwise its value is 0; 

A7 = 1 if A is the base at position 7 of the sense strand, otherwise its value is 0; 

Aio= 1 if A is the base at position 10 of the sense strand; otherwise its value is 0; 

An — 1 if A is the base at position 1 1 of the sense strand, otherwise its value is 0; 

A13 = 1 if A is the base at position 13 of the sense strand, otherwise its value is 0; 
1 5 A19 = 1 if A is the base at position 19 of the sense strand, otherwise if another base 
is present or the sense strand is only 18 base pairs in length, its value is 0; 



C3 = 1 if C is the base at position 3 of the sense strand, otherwise its value is 0; 

C4 = 1 if C is the base at position 4 of the sense strand, otherwise its value is 0; 
20 C 5 = 1 if C is the base at ^position 5 of the sense strand, otherwise its value is 0; 

C6 = 1 if C is the base at position 6 of the sense strand, otherwise its value is 0; 

C 7 = 1 if C is the base at position 7 of the sense strand, otherwise its value is 0; 

C9 = 1 if C is the base at position 9 of the sense strand, otherwise its value is 0; 

C\j = 1 if C is the base at position 17 of the sense strand, otherwise its value is 0; 
25 Cig = 1 if C is the base at position 18 of the sense strand, otherwise its value is 0; 

C19 = 1 if C is the base at position 19 of the sense strand, otherwise if another base 
is present or the sense strand is only 1 8 base pairs in length, its value is 0; 



Gi = 1 if G is the base at position 1 on the sense strand, otherwise its value is 0; 
30 G2 = 1 if G is the base at position 2 of the sense strand, otherwise its value is 0; 
G 8 = 1 if G is the base at position 8 on the sense strand, otherwise its value is 0; 
G10 = 1 if G is the base at position 10 on the sense strand, otherwise its value is 0; 
G13 = 1 if G is the base at position 13 on the sense strand, otherwise its value is 0; 
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Gi9= 1 if G is the base at position 19 of the sense strand, otherwise if another base 
is present or the sense strand is only 1 8 base pairs in length, its value is 0; 



Ui = 1 if U is the base at position 1 on the sense strand, otherwise its value is 0; 
5 U 2 = 1 if U is the base at position 2 on the sense strand, otherwise its value is 0; 
U 3 = 1 if U is the base at position 3 on the sense strand, otherwise its value is 0; 
U 4 = 1 if U is the base at position 4 on the sense strand, otherwise its value is 0; 
U 7 = 1 if U is the base at position 7 on the sense strand, otherwise its value is 0; 
U9 = 1 if U is the base at position 9 on the sense strand, otherwise its value is 0; 
10 U10 = 1 if U is the base at position 10 on the sense strand, otherwise its value is 0; 
U15 = 1 if U is the base at position 15 on the sense strand, otherwise its value is 0; 
Ui6 = 1 if U is the base at position 16 on the sense strand, otherwise its value is 0; 
U17 = 1 if U is the base at position 17 on the sense strand, otherwise its value is 0; 
Ui8 = 1 if U is the base at position 18 on the sense strand, otherwise its value is 0; 

15 

GC15-19 = the number of G and C bases within positions 15 - 19 of the sense strand 

or within positions 15 -18 if the sense strand is only 18 base pairs in length; 
GQotai = the number of G and C bases in the sense strand; 

Tm= 100 if the targeting site contains an inverted repeat longer then 4 base pairs, 
20 otherwise its value is Q; and 

X = the number of times that the same nucleotide repeats four or more times in a row. 
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